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5 IN VITRO PEPTIDE AND ANTIBODY DISPLAY LIBRARIES 

FIELD OF THE INVENTION 

The invention relates to methods and compositions for 

10 generating and screening combinatorial libraries of (1) 

displayed peptides and/ or (2) displayed recombinant single- 
chain antibodies comprising variable region sequences encoded 
by natural or artificial variable region encoding sequences 
which are expressed on polysomes in an in vitro coupled 

15 transcription/translation system to facilitate screening. 

BACKGROUND 

Antibody Display and Screening Methods 
Various molecular genetic approaches have been 

20 devised to capture the vast immunological repertoire 

represented by the extremely large number of distinct variable 
regions which can be present in immunoglobulin chains. The 
naturally-occurring germline immunoglobulin heavy chain locus 
is composed of separate tandem arrays of variable (V) segment 

25 genes located upstream of a tandem array of diversity (D) 
segment genes, which are themselves located upstream of a 
tandem array of joining (J) region genes, which are located 
upstream of the constant (C H ) region genes. During B 
lymphocyte development, V-D-J rearrangement occurs wherein a 

30 heavy chain variable region gene (V H ) is formed by 

rearrangement to form a fused D-J segment followed by 
rearrangement with a V segment to form a V-D-J joined product 
gene which, if productively rearranged, encodes a functional 
variable region (V H ) of a heavy chain. Similarly, light chain 

3 5 loci rearrange one of several V segments with one of several J 
segments to form a aene encodina the variable reaion fv. ) of ^ 

immunog lojauiins derives in parr trom the numerous combinatorial 
40 possibilities of joining V and J segments (and, in the case of 
heavy chain loci, D segments) during rearrangement in B cell 
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development. Additional sequence diversity in the heavy chain 
variable regions arises from non-uniform rearrangements of the 
D segments during V-D-J joining and from N region addition. 
Further, antigen-selection of specific B cell clones selects 
for higher affinity variants having nongermline mutations in 
one or both of the heavy and light chain variable regions; a 
phenomenon referred to as "affinity maturation" or "affinity 
sharpening". Typically, these "affinity sharpening" mutations 
cluster in specific areas of the variable region, most commonly 
in the complementarity-determining regions (CDRs) . 

In order to overcome many of the limitations in 
producing and identifying high-affinity immunoglobulins through 
antigen-stimulated B cell development (i.e., immunization), 
various prokaryotic expression systems have been developed that 
can be manipulated to produce combinatorial antibody libraries 
which may be screened for high-affinity antibodies to specific 
antigens. Recent advances in the expression of antibodies in 
Escherichia coli and bacteriophage systems (see, "Alternative 
Peptide Display Methods", infra) have raised the possibility 
that virtually any specificity can be obtained by either 
cloning antibody genes from characterized hybridomas or by de 
novo selection using antibody gene libraries (e.g., from Ig 
cDNA) . 

Combinatorial libraries of antibodies have been 
generated in bacteriophage lambda expression systems which may 
be screened as bacteriophage plaques or as colonies of lysogens 
(Huse et al. (1989) Science 246 : 1275; Caton and Koprowski 
(1990) Proc. Natl. Acad. Sci. (U.S.A.) 87: 6450; Mullinax et al 

(1990) Proc. Natl. Acad. Sci. (U.S.A.) 87 : 8095; Persson et al. 

(1991) Proc. Natl. Acad. Sci. (U.S.A.) 88: 2432). 
Unfortunately, lambda-based combinatorial antibody expression 
libraries are not suited for screening of large numbers of 
library members (i.e., greater than 10 8 -10 9 members) nor are 
lambda-based combinatorial libraries suitable for qpip-^^/n 



aie displayed on the surtace oi t liainentous bacteriophage 
(Scott and Smith (1990) Science 249 : 386) have proven 
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attractive for forming various combinations of heavy chain 
variable regions and light chain variable regions (and the 
polynucleotide sequences encoding them) for in vitro selection 
and enrichment by binding to specific antigen. Polynucleotide 
5 sequences encoding heavy and light chain variable regions are 
linked to gene fragments that encode signals that direct them 
to the periplasmic space of coli and the resultant 
"antibodies" are displayed on the surface of bacteriophage, 
typically as fusions to bacteriophage coat proteins (e.g., pill 
10 or pVIII) . Variable region fragments of immunoglobulins 

(either Fv or Fab) can be displayed externally on phage capsids 
(phagebodies) and recombinant phage are selected for by binding 
to immobilized antigen. 

Various embodiments of bacteriophage antibody display 
15 libraries and lambda phage expression libraries have been 

described (Kang et al. (1991) Proc. Natl. Acad. Sci. (U.S.A.) 
88: 4363; Clackson et al. (1991) Nature 352 : 624; McCafferty et 
al. (1990) Nature 348 : 552; Burton et al. (1991) Proc. Natl. 
Acad. Sci. (U.S.A.) 88 : 10134; Hoogenboom et al. (1991) Nucleic 
20 Acids Res. 19: 4133; Chang et al. (1991) J. Immunol. 147 : 3610; 
Breitling et al. (1991) Gene 104 ; 147; Marks et al. (1991) J. 
Mol. Biol. 222 ; 581; Barbas et al. (1992) Proc. Natl. Acad. 
Sci. (U.S.A.) 89: 4457; Hawkins and Winter (1992) J. Immunol. 
22: 867; Marks et al. (1992) Biotechnology 10 : 779; Marks et 
25 al. (1992) J. Biol. Chem. 267: 16007; Lowman et al (1991) 

Biochemistry 30 : 10832; Lerner et al. (1992) Science 258 : 1313 , 
incorporated herein by reference) . 

One particularly advantageous approach has been the 
use of so-called single-chain fragment variable (scFv) 
30 libraries (Marks et al. (1992) Biotechnology 10 : 779; Winter G 
and Milstein C (1991) Nature 349 : 293; Clackson et al. (1991) 
op.cit. : Marks et al. (1991) J. Mol. Biol. 222 : 581; Chaudhary 
et al. (1990) Proc. Natl. Acad. Sci. (USA) 87: 1066; Chiswell 
et al Mqqp\ TTBTFCH in* pn • v^nffo^ 4 "' " .. „ 



bacteriophage coat proteins have been described. 
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Beginning in 1988, single-chain analogues of Fv 
fragments and their fusion proteins have been reliably 
generated by antibody engineering methods. The first step 
generally involves obtaining the genes encoding V H and V L 
5 domains with desired binding properties; these V genes may be 
isolated from a specific hybridoma cell line, selected from a 
combinatorial V-gene library, or made by V gene synthesis. The 
single-chain Fv is formed by connecting the component V genes 
with an oligonucleotide that encodes an appropriately designed 

10 linker peptide, such as (Gly-Gly-Gly-Gly-Ser) 3 or equivalent 
linker peptide(s). The linker bridges the C-terminus of the 
first V region and N-terminus of the second, ordered as either 
V H -linker-V L or V L -linker-V H . In principle, the scFv binding 
site can faithfully replicate both the affinity and specificity 

15 of its parent antibody combining site. 

Thus, scFv fragments are comprised of V H and V L 
domains linked into a single polypeptide chain by a flexible 
linker peptide. After the scFv genes are assembled, they are 
cloned into a phagemid and expressed at the tip of the M13 

20 phage (or similar filamentous bacteriophage) as fusion proteins 
with the bacteriophage pill (gene 3) coat protein. Enriching 
for phage expressing an antibody of interest is accomplished by 
panning the recombinant phage displaying a population scFv for 
binding to a predetermined epitope (e.g., target antigen, 

2 5 receptor) . 

Various methods have been reported for increasing the 
combinatorial diversity of a scFv library to broaden the 
repertoire of binding species (idiotype spectrum). The use of 
PCR has permitted the variable regions to be rapidly cloned 

3 0 either from a specific hybridoma source or as a gene library 

from non-immunized cells, affording combinatorial diversity in 
the assortment of V H and V L cassettes which can be combined. 
Furthermore, the V H and V L cassettes can themselves be 
diversified, such as bv random pseudorandom diro^fod 

the third CDR, CDRJ . Enzymatic inverse PCR mutagenesis has 
been shown to be a simple and reliable method for constructing 
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relatively large libraries of scFv site-directed mutants 
(Stemmer et al. (1993) Biotechniques 14 : 256) , as has error- 
prone PCR and chemical mutagenesis (Deng et al. (1994) J, Biol. 
Chem. 269 : 9533). Riechmann et al. (1993) Biochemistry 32 : 
8848 showed semirational design of an antibody scFv fragment 
using site-directed randomization by degenerate oligonucleotide 
PCR and subsequent phage display of the resultant scFv mutants. 
Barbas et al. (1992) op. cit. attempted to circumvent the 
problem of limited repertoire sizes resulting from using biased 
variable region sequences by randomizing the sequence in a 
synthetic CDR region of a human tetanus toxoid-binding Fab. 

CDR randomization has the potential to create 
approximately 1 x 10 20 CDRs for the heavy chain CDR3 alone, and 
a roughly similar number of variants of the heavy chain CDR1 
and CDR2, and light chain CDR1-3 variants. Taken individually 
or together, the combinatorics of CDR randomization of heavy 
and/or light chains requires generating a prohibitive number of 
bacteriophage clones to produce a clone library representing 
all possible combinations, the vast majority of which will be 
non-binding. Generation of such large numbers of primary 
transf ormants is not feasible with current transformation 
technology and bacteriophage display systems. For example, 
Barbas et al. (1992) op. cit. only generated 5 x 10 7 
transf ormants , which represents only a tiny fraction of the 
potential diversity of a library of thoroughly randomized CDRs. 

A further limitation of present bacteriophage scFv 
display systems is produced by the constraints of the 
prokaryotic systems used to generate the bacteriophage 
libraries. For example, prokaryotic in vivo display systems 
often suffer from defective secretion, rapid proteolysis, 
and/or formation of insoluble inclusion bodies containing the 
"displayed" scFv due to various factors, including high level 

expression (Mallender WD and Voss EW (1994) J. Biol. Chem. 269 : 

i n o > 

• i.opid f ^■w'.- . :..ivc j.reaa r , leiaeu va neiy c: usetux 

antibodies and antibody fusion proteins. A bispecific single 
chain antibody has been shown to mediate efficient tumor cell 
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lysis (Gruber et al. (1994) J. Immunol. 152 : 5368). 
Intracellular expression of an anti-Rev scFv has been shown to 
inhibit HIV-1 virus replication in vitro (Duan et al. (1994) 
Proc. Natl. Acad. Sci . (USA) 91 : 5075) , and intracellular 
5 expression of an anti-p21 ras scFv has been shown to inhibit 
meiotic maturation of Xenopus oocytes (Biocca et al. (1993) 
Biochem. Biophvs. Res. Commun. 197: 422. Recombinant scFv 
which can be used to diagnose HIV infection have also been 
reported, demonstrating the diagnostic utility of scFv (Lilley 

10 et al. (1994) J. Immunol. Meth. 171 : 211) . Fusion proteins 
wherein an scFv is linked to a second polypeptide, such as a 
toxin or fibrinolytic activator protein, have also been 
reported (Holvost et al. (1992) Eur. J. Biochem. 210 : 945; 
Nicholls et al. (1993) J. Biol. Chem. 268 : 5302). 

15 If it were possible to generate scFv libraries having 

broader antibody diversity and overcoming many of the 
limitations of a prokaryotic in vivo display system, the number 
and quality of scFv antibodies suitable for therapeutic and 
diagnostic use could be vastly improved. 

20 Based on the foregoing, it is evident that there is a 

need in the art for methods to generate scFv antibody libraries 
which comprise a broader diversity and which are not limited by 
the fundamental constraints of in vivo display systems. The 
present invention fulfills this need and others. 

25 

Alternative Peptide Display Methods 
An increasingly important aspect of biopharmaceutical 
drug development and molecular biology is the identification of 
peptide structures, including the primary amino acid sequences, 
3 0 of peptides or peptidomimetics that interact with biological 
macromolecules. One method of identifying peptides that 
possess a desired structure or functional property, such as 
binding to a predetermined biological macromolecule (e.g., a 

acid sequence of the peptide. 
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Several approaches to generating and screening large 
libraries of random or pseudorandom peptide sequences suitable 
for screening, selection, and identification of desired 
individual library members have been proposed in the art. One 
5 category of peptide library is produced by direct chemical 
synthesis of the library members. One early method involves 
the synthesis of peptides on a set of pins or rods, such as is 
described in PCT patent publication Nos. 84/03564 and 84/03564. 
A similar method involving peptide synthesis on beads, which 
10 forms a peptide library in which each bead is an individual 

1 £ . WMW 'U nv . ■i ~ -J -I V^^r3 -I T% TT C T> -» +■ v» +■ / C11 11 1 «»xts3 -% 

related method is described in PCT patent publication No. 
92/00091. A significant improvement of the bead-based methods 
involves tagging each bead with a unique identifier tag, such 
15 as an oligonucleotide, so as to facilitate identification of 

the amino acid sequence of each library member. These improved 
bead-based methods are described in PCT publication No. 
93/06121. 

Another chemical synthesis method involves the 
20 synthesis of arrays of peptides (or peptidomimetics) on a 

surface in a manner that places each distinct library member 
(e.g., unique peptide sequence) at a discrete, predefined 
location in the array. The identity of each library member is 
determined by its spatial location in the array. The locations 
25 in the array where binding interactions between a predetermined 
molecule (e.g., a receptor) and reactive library members occur 
is determined, thereby identifying the sequences of the 
reactive library members on the basis of spatial location. 
These methods are described in U.S. Patent 5,14 3,854; PCT 
30 patent publication Nos. 90/15070 and 92/10092; Fodor et al. 

(1991) Science 251 : 767; and Dower and Fodor (1991) Ann ■ Rep . 
Med . Chem . 26: 271. 

In addition to the direct chemical synthesis methods 

: <; pepnau sequence, ant-ifcoay, : other protein o:. tne 
surface of a bacteriophage particle or cell. Generally, in 
these methods each bacteriophage particle or cell serves as an 
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individual library member displaying a single species of 
displayed peptide in addition to the natural bacteriophage or 
cell protein sequences. Each bacteriophage or cell contains 
the nucleotide sequence information encoding the particular 
displayed peptide sequence; thus, the displayed peptide 
sequence can be ascertained by nucleotide sequence 
determination of an isolated library member, 

A well-known peptide display method involves the 
presentation of a peptide sequence on the surface of a 
filamentous bacteriophage, typically as a fusion with a 
bacteriophage coat protein. The bacteriophage library can be 
incubated with an immobilized, predetermined macromolecule or 
small molecule (e.g., a receptor) so that bacteriophage 
particles which present a peptide sequence that binds to the 
immobilized macromolecule can be differentially partitioned 
from those that do not present peptide sequences that bind to 
the predetermined macromolecule. The bacteriophage particles 
(i.e., library members) which are bound to the immobilized 
macromolecule are then recovered and replicated to amplify the 
selected bacteriophage subpopulation for a subsequent round of 
affinity enrichment and phage replication. After several 
rounds of affinity enrichment and phage replication, the 
bacteriophage library members that are thus selected are 
isolated and the nucleotide sequence encoding the displayed 
peptide sequence is determined, thereby identifying the 
sequence (s) of peptides that bind to the predetermined 
macromolecule (e.g., receptor). Such methods are further 
described in PCT patent publication Nos. 91/17271, 91/18980, 
and 91/19818 and 93/08278. 

The latter PCT publication describes a recombinant 
DNA method for the display of peptide ligands that involves the 
production of a library of fusion proteins with each fusion 
protein composed of a first polypeptide portion, typically 
o oiripr isincx a var i rM p spanpnnp +~ h r> -I <? ^ i p ^^>- : . -> 

, j i... „ h. ^ . ... . j. . -Jt. ... ..... ... w .. u'lut H ..... -a « v..; ..... * t l...' L'iiri. v Li lJ XL. O .1 

encoding the individual fusion protein. When transformed host 
cells are cultured under conditions that allow for expression 



WO 95/11922 PCT/US94/12206 



of the fusion protein , the fusion protein binds to the DNA 
vector encoding it. Upon lysis of the host cell, the fusion 
protein/ vector DNA complexes can be screened against a 
predetermined macromolecule in much the same way as 
5 bacteriophage particles are screened in the phage-based display 
system, with the replication and sequencing of the DNA vectors 
in the selected fusion protein/vector DNA complexes serving as 
the basis for identification of the selected library peptide 
sequence (s) . 

10 Other systems for generating libraries of peptides 

vitro chemical synthesis methods. In these hybrid methods, 
cell-free enzymatic machinery is employed to accomplish the in 
vitro synthesis of the library members (i.e., peptides or 

15 polynucleotides) . In one type of method, RNA molecules with 

the ability to bind a predetermined protein or a predetermined 
dye molecule were selected by alternate rounds of selection and 
PCR amplification (Tuerk and Gold (1990) Science 249 : 505; 
Ellington and Szostak (1990) Nature 346 : 818) ♦ A similar 

2 0 technique was used to identify DNA sequences which bind a 
predetermined human transcription factor (Thiesen and Bach 
(1990) Nucleic Acids Res. 18: 3203; Beaudry and Joyce (1992) 
Science 257 : 635; PCT patent publication Nos. 92/05258 and 
92/14843) . In a similar fashion, the technique of in vitro 

25 translation has been used to synthesize proteins of interest 
and has been proposed as a method for generating large 
libraries of peptides. These methods which rely upon in vitro 
translation, generally comprising stabilized polysome 
complexes, are described further in PCT patent publication Nos. 

30 88/08453, 90/05785, 90/07003, 91/02076, 91/05058, and 92/02536. 
Applicants have described methods in which library members 
comprise a fusion protein having a first polypeptide portion 
with DNA binding activity and a second polypeptide portion 



iinonq owners. 

Although the various methods described above for 
generating and screening peptide libraries have been reported, 
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there exists a need for additional methods for making peptide 
libraries, selecting desired library members, and identifying 
the peptide sequence (s) of said desired library members. 
Alternative methods which (1) increase the primary peptide 
5 library size, (2) facilitate rapid, efficient, and inexpensive 
library construction and screening, or (3) possess other 
advantageous features would meet a need in the art for improved 
peptide library methods. For instance, some of the in vitro 
translation-based methods suffer from the instability of the 

10 polysome complexes, which leads to poor recovery of the nucleic 
acids that encode the peptide sequence of interest. 
Additionally, polysomes are relatively large and the resultant 
slower diffusion in solvent leads to relatively inefficient 
capture of polysomes by immobilized ligand/receptor during 

15 screening. The recombinant methods described above can only be 
used to produce libraries of compounds composed of subunits and 
library members capable of being produced by the host cell, and 
thus for example are not suited for producing library members 
comprising non-naturally occurring amino acids and peptide 

20 sequences which adversely affect the host cell, among other 

sequences. The present invention meets the need for advanced 
methods for generating and screening such desirable peptide 
libraries, and in one aspect provides libraries of single-chain 
antibodies displayed on nascent polysomes. 

25 All publications and patent applications herein are 

incorporated by reference to the same extent as if each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 



3 0 SUMMARY OF THE INVENTION 

The present invention provides an improved method for 
generating libraries of polysomes displaying nascent peptides 
suitable for affinity interaction screening. The improvement 



amino acids long or longer, frequently from 5-100 amino acids 
long, and often from about 8-15 amino acids long. A library 
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can comprise library members having varying lengths of 
displayed peptide sequence, or may comprise library members 
having a fixed length of displayed peptide sequence. Portions 
or all of the displayed peptide sequence (s) can be random, 
pseudorandom, defined set kernal, fixed, or the like. The 
present display methods include methods for in vitro display of 
single-chain antibodies, such as nascent scFv, which enable 
large-scale screening of scFv libraries having broad diversity 
of variable region sequences and binding specificities. 

The present invention also provides a method for 
affinity screening a library of polysomes displaying nascent 
peptides (including single-chain antibodies) for library 
members which bind to a predetermined receptor (e.g., a 
mammalian proteinaceous receptor such as, for example, a 
peptidergic hormone receptor, a cell surface receptor, an 
intracellular protein which binds to other protein (s) to form 
intracellular protein complexes such as heterodimers and the 
like) or epitope (e.g., an immobilized protein, glycoprotein, 
oligosaccharide, and the like) . An improvement of this method 
comprises contacting a preblocking agent with the receptor or 
epitope (or immobilized epitope surface or immobilized receptor 
surface) prior to and/or concomitant with contacting the 
polysome library with the epitope or receptor (or immobilized 
epitope surface or receptor surface) . Suitable preblocking 
agents include casein, nonfat milk, bovine serum albumin, 
gelatin, tRNA, and the like. Optionally, a non-ionic detergent 
(e.g. Tween, NP-40) is included to reduce nonspecific binding. 

The present invention also provides a method for 
generating libraries of polysomes displaying nascent single- 
chain antibodies. In an embodiment, the method comprises using 
a coupled in vitro transcription/translation system to generate 
the polysomes from a library of DNA templates. Each DNA 
template library member comprises a gene cassette encoding a V H 

. >-^er , . tnc . ,kl , ,ir:a nuv comprise additional lerir.maj 

peptide sequences, such as epitope tags, fusion partner 
polypeptides, and the like. 
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The present invention also provides an improved 



method for generating libraries of polysomes displaying nascent 



transcription/ translation system to generate polysomes from a 
5 library of DNA templates; the resultant library of polysomes 
represents a range of displayed peptide sequences. 



screening a library of polysomes displaying (1) nascent 
peptides or (2) single-chain antibodies for species having high 
10 binding affinity for a predetermined receptor or epitope 



additional step of placing sequences encoding positively 
selected nascent peptides or single-chain antibodies obtained 
by screening a polysome library into a bacteriophage display 

15 system for further affinity screening , such as under screening 
conditions incompatible with retention of intact polysome 
structure. Stated generally, an improvement of the method 
comprises a sequential affinity screen process utilizing a 
plurality of expression systems, wherein (1) a first expression 

20 system (e.g.) a library of in vitro translated polysomes 

displaying nascent peptides) is screened for library members 
which bind to a predetermined receptor(s) or epitope(s), 
thereby selecting library members having substantial binding 
affinity for the predetermined receptor (s) or epitope (s) ; (2) 

25 the displayed peptide sequence (s) in the selected library 

members are identified and/or isolated thereby constituting 
first-round selected peptide sequences; (3) a second expression 
system (e.g., bacteriophage coat protein peptide display or a 
second in vitro expression system) comprising a population of 

30 library members which is substantially enriched for the first- 
round selected peptide sequences is screened for library 
members which bind to the predetermined receptor (s) or 
epitope(s), thereby selecting library members having 



peptides. The improvement comprises using a coupled in vitro 



The present invention further provides a method of 



/ a y-» +- "i it r> \ 

^ M * 4 W ^ >^ W-* A f 
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constituting subsequent-round selected peptide sequences. 
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The present invention provides novel methods for 
generating and screening single-chain antibody (e.g., scFv) 
libraries by in vitro synthetic methods. The single-chain 
antibody libraries can be screened to select and identify 
5 individual library members having the ability to bind or 

otherwise interact (e.g., such as catalytic antibodies) with a 
predetermined raacromolecule, such as for example a 
proteinaceous receptor, peptide, oligosaccharide, virion, or 
other predetermined compound or structure. The individual 

10 library members typically comprise peptides or single-chain 

antibodies composed of naturally-occurring amino acids, but in 
some embodiments may comprise alternative amino acids, imino 
acids, or other building blocks compatible with in vitro 
translation systems employing unnatural aminoacyl tRNA species 

15 ( see , PCT publication No. W090/05785) . The displayed peptides, 
antibodies, peptidoraimetic antibodies, and variable region 
sequences that are identified from such libraries can be used 
for therapeutic, diagnostic, research, and related purposes 
(e.g., catalysts, solutes for increasing osmolarity of an 

20 aqueous solution, and the like) . 

In a method of the invention, a single-chain antibody 
library is generated by in vitro synthesis in a cell-free 
system, wherein individual library members comprise a nascent 
polypeptide comprising a V H domain in polypeptide linkage to a 

25 V L domain, and wherein the nascent polypeptide is linked to a 
polynucleotide encoding said nascent polypeptide (or a 
polynucleotide complementary to the encoding polynucleotide 
sequence) , such linkage typically being accomplished by a 
ribosome bound on a stalled polysome. 

3 0 In a method of the invention, a peptide library is 

generated by in vitro synthesis in a cell-free system, wherein 
individual library members comprise a nascent polypeptide 
comprising a first polypeptide portion consisting of a random, 

^ <: ^' :i ll(i^^^^^;d^; ,,, " ^ o ^ ^* n o H V p >~ r* ~i 1 prr^n^" r ; >-^ i no^l ■— n ^ ^ ri ^ ■-- r ■ * 
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(or a polynucleotide complementary to the encoding 
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polynucleotide sequence) , such linkage typically being 
accomplished by a ribosome bound on a stalled polysome. 

Alternatively, the nascent polypeptide may comprise a 
first polypeptide portion consisting of a random, pseudorandom, 
5 defined kernal, or predetermined sequence (or combination (s) 

thereof) or scFv in polypeptide linkage to a second polypeptide 
portion ("tether" 1 ) linked to a polynucleotide encoding said 
nascent polypeptide (or to a polynucleotide complementary to 
the encoding polynucleotide sequence) . The nascent peptide or 
10 antibody is synthesized as a fusion protein comprising: (1) a 

nnl \ mn r«l n«f i ^ft-Vii r>/^ i n« r^'r-+- t /-\r"» 4- >-Tn r\ r? 4- V» o 1 1 4- r\ 4- V» >- r* r\**ryn <-% v» 4- II 

£s w jr a i u ^» i» w ^ vt +~> -A. * * Ail ^ £S w a» w ^ w« a , ^ «w ^ uivw * w w wa a t» a. w ^ * l * 1 k» , 

comprising a polypeptide sequence which binds to the encoding 
mRNA molecule serving as the translation template for the 
synthesis of the nascent antibody, or to a bound DNA primer or 

15 cDNA copy of such encoding mRNA, either directly or through 

binding an intermediate molecule (biotin, digoxigenin, or the 
like) that is linked directly to the encoding mRNA or cDNA copy 
thereof, and (2) a second polypeptide portion, termed (1) the 
"displayed peptide", comprising a random, pseudorandom, defined 

20 kernal, or predetermined sequence (or combination (s) thereof), 
or (2) "single-chain antibody", comprising a V H and V L each 
having one of a variety of possible amino acid sequence 
combinations represented in the library. The tether segment 
serves to link the displayed peptide or single-chain antibody 

25 of an individual library member to the polynucleotide 

comprising the sequence information encoding the amino acid 
sequence of the individual library member's displayed peptide 
or V H and V L domains. The linked polynucleotide of a library 
member provides the basis for replication of the library member 

30 after a screening or selection procedure, and also provides the 
basis for the determination, by nucleotide sequencing, of the 
identity of the displayed peptide sequence or V H and V L amino 
acid sequence. The displayed peptide(s) or single-chain 
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domains will be ligated to polynucleotides encoding constant 
regions (C H and C L ) to form polynucleotides encoding complete 
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antibodies (e.g., chimeric or fully-human), antibody fragments, 
and the like. Often polynucleotides encoding the isolated CDRs 
will be grafted into polynucleotides encoding a suitable 
variable region framework (and optionally constant regions) to 
5 form polynucleotides encoding complete antibodies (e.g., 

humanized or fully-human), antibody fragments, and the like. 

In one embodiment, the tether segment comprises a 
RNA-binding polypeptide sequence that binds to the mRNA serving 
as the translation template for the nascent polypeptide. 

10 Typically, the tether segment comprising an RNA-binding 
polypeptide sequence has a conserved RNA-binding domain 
structure noted in RNA-binding proteins, such as an RNP motif, 
an arginine-rich motif (ARM) , an RGG box, a KH (hnRNP K 
homology) motif, a dsRNA-binding motif, a zinc finger /knuckle, 

15 a cold-shock domain, or combination (s) thereof ( see , Burd CG 
and Dreyfuss G (1994) Science 265 : 615) . For example and not 
limitation, an RNA-binding tether segment can comprise: (1) an 
RNP1 and/or RNP 2 consensus sequence (e.g. , substantially 
identical to KGFGFVXF, RGYAFVXY, LFVGNL, or IYIKGM) , (2) an 

20 arginine-rich domain (e.g., TRQARRNRRRRWRERQ , 

ALGISYGRKKRRQRRRP, MDAQTRRRERRAEKQAQW , GTAKSRYKARRAELIAER, or 
GNAKTRRHERRRKLAIER) , (3) an RGG box (e.g., typically at least 
2, 3, 4, or 5 RGG sequences), (4) a KH motif, or (5) 
combination (s) thereof can be present in the tether. Other 

2 5 RNA-binding sequence motifs known in the art can be employed, 

and novel RNA-binding peptide motifs (such as obtained by 
directed evolution, screening libraries for RNA-binding 
species, and the like) can also be used. 

In an alternative embodiment, the tether segment 

3 0 comprises an epitope bound by an immunoglobulin which is 

covalently linked either to the mRNA serving as the translation 
template for the nascent polypeptide or to a cDNA copy thereof. 

In another embodiment, the tether segment comprises a 
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streptavidin molecule linked either to the mRNA serving as the 
translation template for the nascent single-chain antibody or 
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to a cDNA copy thereof; the streptavidin is linked to the mRNA 
or cDNA by direct covalent linkage or through noncovalent 
binding to biotin moieties incorporated into the mRNA or cDNA. 
Various additional embodiments are described. 
5 In one embodiment, no tether segment is used; the 

nascent single-chain antibody is coupled to the polynucleotide 
(e.g., mRNA) by the translating ribosome which links the 
nascent single-chain antibody to the polysome complex. In such 
embodiments, translation stalling sequences are often 

10 incorporated into the mRNA to produce slowing/stalling of 
translation to enhance the stability of polysomes. 

In one variation, the invention also provides a 
method of generating nascent peptide or single-chain antibody 
libraries comprising the steps of: (1) translating in vitro an 

15 mRNA population wherein individual mRNA molecules individually 
encode a nascent polypeptide comprising a tether segment and a 
variable peptide segment or single-chain antibody (e.g., scFv) 
segment, under translation conditions wherein said tether 
segment binds to the encoding template mRNA or a polynucleotide 

2 0 primer annealed thereto prior to dissociation of the nascent 

peptide from the translation complex, thus producing a library 
of nascent peptide or single-chain antibody library members, 
(2) synthesizing a first-strand cDNA copy of the encoding mRNA 
species by reverse transcription primed from an extendable 

25 polynucleotide primer annealed to the template mRNA 3' to the 
portion of the mRNA encoding the nascent peptide or single- 
chain antibody sequence, optionally hydrolyzing the mRNA 
templates, thus producing a library of cDNA-containing nascent 
peptide or single-chain antibody library members, (3) screening 

30 the library of nascent peptide or single-chain antibody library 
members by contacting the library to an immobilized 
macromolecular species under binding conditions and separating 
library members bound to the macromolecular species from 

t] n bO U n d 1 i H 7~ r* ?~ V I 1 " 1 P TT h> P T ^ qp"| o^f t nrj ot^Hov- V^^T^ *" ^ >-.-■- "> 

nascent library members, (5) iigating a suitable promoter and 
translation start site, if necessary (e.g., may be contained in 
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the extendable polynucleotide primer) , to the cDNA in the 
appropriate orientation to drive transcription of an mRNA 
complementary to the first-strand cDNA forming a transcription 
template (i.e., DNA template library member), (6) transcribing 
mRNA complementary to the first-strand cDNA from the 
transcription template , (7) repeating steps (1) through (6) 
until the desired level of affinity enrichment for selected 
bound (or unbound) nascent peptide or single-chain antibody 
library members is attained, and (8) isolating individual cDNA 
from the selected library members and determining the 
nucleotide sequence (s) of th& variable peptide segment (s) 
and/or single-chain antibody segment (s) and/or determining the 
variable peptide segment, V H , V L , and/or CDR nucleotide 
sequence distribution (s) in the selected population by 
collectively sequencing the collection of cDNAs represented in 
the population of selected library members. In some 
variations, steps 4, 5, 6, and/or 7 may be omitted. Generally, 
the mRNA population of step (1) is generated by in vitro 
transcription of a DNA template library, wherein each DNA 
template library member encodes a polypeptide comprising a 
tether sequence and a variable peptide sequence or a single- 
chain antibody sequence. Each DNA template library member also 
comprises an operably linked promoter, especially a promoter 
suitable for in vitro transcription and sequences required for 
in vitro translation of the transcription product (mRNA) , such 
as a ribosome binding site. 

The method may also comprise the variation wherein 
the transcription template (s) formed in step (5) (or portion 
thereof encoding the variable segment) or selected library 
members obtained by affinity screening is/are cloned into a 
phagemid expression vector (e.g., pAFF6) so that the encoded 
variable peptide sequence or single-chain antibody polypeptide 
sequence is expressed as a fusion with a bacteriophage coat 



oingie-cnaifj ant l dog y po i ypeptide sequences may be usen t or nnp 
or more subsequent rounds of affinity selection* 
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In an alternative variation, selected library members 
can be cloned or otherwise amplified, followed by additional 
rounds of in vitro translation and selection, avoiding the 
requirement that selected library members encode polypeptide 
5 sequences which are compatible with bacteriophage coat protein 
function and/or which are compatible with functional expression 
in a prokaryotic host cell. In one embodiment, selected 
library members are cloned in a prokaryotic vector (e.g., 
plasmid, phagemid, or bacteriophage) wherein a collection of 

10 individual colonies (or plaques) representing discrete library 
members are produced. Individual selected library rut: rubers can 
then be manipulated (e.g., by site-directed mutagenesis, 
cassette mutagenesis, chemical mutagenesis, PCR mutagenesis, 
and the like) to generate a collection of library members 

15 representing a kernal of sequence diversity based on the 

sequence of the selected library member. The sequence of an 
individual selected library member can be manipulated to 
incorporate random mutation, pseudorandom mutation, defined 
kernal mutation (i.e., comprising variant and invariant residue 

2 0 positions and/or comprising variant residue positions which can 
comprise a residue selected from a defined subset of amino acid 
residues) , and the like, either segmentally or over the entire 
length of the individual selected library member sequence. 

The method may also comprise the variation that the 

2 5 individual library members may be directly sequenced 

individually (i.e., not collectively) by diluting the pool of 
affinity-selected library members such that about 1 library 
member cDNA is represented in each separate reaction vessel 
(e.g., microtitre well). Each cDNA is then amplified by PCR 

3 0 and sequenced. 

The invention also provides compositions comprising 
individual library members that comprise a nascent polypeptide 
comprising a first polypeptide portion linked to a 
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variable amino acid segment or (2) a single-chain antibody, in 
peptide linkage to said first polypeptide portion. In one 
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aspect of the invention, the individual library members lack 
bound ribosomes, for example lacking ribosomes bound to a mRNA 
in a translation complex (e.g., polysome). 

The invention also provides compositions comprising a 
nascent single-chain antibody polysome library which consists 
of a population of library members wherein essentially each 
library member comprises a single-chain antibody bound as a 
nascent polypeptide in a polysome. Typically, such libraries 
substantially lack library members encoding nascent 
polypeptides that do not comprise at least 15 contiguous amino 
acids of a naturally— occurring iiuiuunoy lobui in sequence, 
preferably a human immunoglobulin (e.g., human V H or V L ) 
sequence. Such library members may comprise a tether segment, 
a translation stall segment, both, or neither of these. 

The invention also provides peptide libraries 
comprising a plurality of individual library members of the 
invention, wherein (1) each individual library member of said 
plurality comprises a tether segment sequence which is 
substantially identical to the tether segment sequences of the 
remainder of individual library members in said plurality, and 
(2) each individual library member comprises a variable peptide 
segment sequence or single-chain antibody segment sequence 
which is distinct from the variable peptide segment sequences 
or single-chain antibody sequences of other individual library 
members in said plurality (although some library members may be 
present in more than one copy per library due to uneven 
amplification, stochastic probability, or the like) . 

The invention also provides novel compositions 
comprising at least one library member, said library member 
comprising a mRNA molecule, or cDNA copy thereof, linked with 
the nascent variable peptide segment or nascent single-chain 
antibody encoded by said mRNA, wherein the linkage of the mRNA 
or cDNA to the nascent peptide is by noncovalent binding to the 

The invention also provides a product-by-process, 
wherein antibodies having a predetermined binding specificity 
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are formed by the process of: (1) screening a nascent single- 
chain antibody polysome library against a predetermined epitope 
(e.g., antigen macromolecule) and identifying and/or enriching 
library members which bind to the predetermined epitope, and 
5 (2) expressing in a cell a single-chain antibody encoded by a 
library member (or copy thereof) which binds the predetermined 
epitope and has been thereby isolated and/or enriched from the 
library. 



10 BRIEF DESCRIPTION OF THE DRAWINGS 

fly Ul <=: J. . mxs liyuit. ^j.j.w*v^> xiijl \-fj_ mu w-j. wu a. <^j.<_* L-xny \_ w 

construction of a synthetic gene for expressing the D32.39 
epitope or control, non-binding, peptides in vitro . Partial 
restriction map of the bacteriophage T7 promoter expression 

15 plasmid, pT7-7. The figure shows the nucleotide sequence and 
predicted amino acid sequence of the D32.39 epitope fusion 
protein after linearizing plasmid pLM13 8 with HindlXI. 
Nucleotides are numbered on the right; amino acids are numbered 
on the left* The gene was constructed by annealing synthetic 

20 oligonucleotides to their complementary strands to generate 

double stranded cassettes flanked by the indicated restriction 
sites. Individual cassettes were cleaved by the appropriate 
restriction enzymes and subcloned sequentially to pT7-7 
starting with the Sall/HindlXI cassette, and followed by the 

25 BamHl/Sall and EcoRI/BamHI cassettes. The Ndel/EcoRI cassettes 
were subcloned last and contained either the D32.39 epitope 
sequence shown or the control sequence, 5 1 

CATATG GCTGTTTTCAAACGTACCGTTCA GGAATC 3 • (ffdel and EcoRI sites 
are underlined) . 
30 Figure 2. Specific binding of polysomes to mAb 

D32.39. Radiolabelled polysomes were isolated from reactions 
programmed with 1.5 /xg of tfindlll-linearized plasmid pLM138 or 
pLM142 and bound to microtiter wells containing the immobilized 



Competition binding assay. Microtiter wells were preincubated 
with polysome buffer in the absence or presence of 10 
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dynorphin B peptide for 1 hr at 4° C prior to adding 131,000 
cpm of polysomes containing the D32.39 epitope (RQFKWT) or 
control (VFKRTVQ) sequences. 

Figure 3. Construction of a DNA library containing a 
5 random population of decacodon sequences. Panel (a) : The 

nucleotide sequence of the degenerate region is shown on the 
left with the numbers indicating the nucleotide positions. The 
degenerate region was constructed by annealing 100 pinoles each 
of oligonucleotides ON1543 (positions 1-90) and ON1747 

10 (complementary to positions 74-146) and extending in a reaction 
containing 104 units sequenase (US Biochemical) /I mM dNTP/iu mM 
DTT for 3 0 min at 37° C. The extended product was cleaved with 
BstXI , ethanol precipitated, and resuspended in water. The 
Gly-Ser coding region of plasmid pLM142 was modified by 

15 inserting noncomplementary BstXI site linkers between the 

Hindlll/Clal sites and Ndel fEcoRl sites resulting in plasmid 
pLM144. Plasmid pLM144 was cleaved with BstXI and the 277 bp 
fragment containing the Gly-Ser coding region shown on the 
right was gel purified, quantitated, and 4 }iq were ligated to 

2 0 an equivalent amount of the degenerate region in a reaction 

containing 400 units T4 ligase/50 mM Tris-Cl pH 8/10 mM 
MgCl 2 /10 mM DTT/1 mM ATP/25 /xg/ml BSA for 16 hrs at 15° C. The 
323 bp ligated product was gel purified and quantitated. The 
overlined sequences indicate the T7 promoter, gene 10 ribosome 
25 binding site (SD) and the initiator methionine (ATG) . Panel 

(b) : A schematic overview of the procedure used to produce the 
library members. 

Figure 4. Subcloning of the DNA pool to the phagemid 
vector, pAFF6 , for sequencing and ELISA. Approximately 25 ng 

3 0 of DNA was cleaved with Nhel/Kpnl before and after each round 

of affinity selection and ligated to the same sites of pAFF6 
resulting in translational fusions of library peptides to the 
pill capsid protein of M13 (C. Wagstrom, personal 
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of VCSM13 helper phage to isolate recombinant phage as 
previously described ( ) . 
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Figure 5. The effect of DNA library concentration on 
protein synthesis in vitro . The incorporation of 
[ 35 S]methionine into protein was measured as described in the 
Materials and Methods. 

Figure 6. Amino acid alignment of selected peptide 
sequences with dynorphin B. The six-residue D32.39 epitope 
sequence of dynorphin B and the peptide regions similar to it 
are shown in the box. A total of 6, 13, 19, and 9 independent 
clones were sequenced from rounds 2, 3, 4 and 5, respectively. 
The frequency indicates the number of times each sequence 
occurred among the clones isolated from each round, and the 
asterisks indicate identical sequences found in different 
rounds. Binding affinities for D32.39 were determined by 
chemically synthesizing the indicated peptide sequences and 
measuring the IC 50 as described in the Experimental Examples. 

Figure 7. Schematic maps of plasmids pLM169, pLM 
166, and pLM 153. 

Figure 8. Determination of soluble antibody binding 

by ELISA. 

Figure 9. Polysome isolation and binding of 
antibodies displayed on polysomes. 

Figure 10. Schematic overview of a representative 
nascent peptide display method of the invention. The defined 
sequence kernal (NNK) n represents the variable peptide portion 
of the nascent polypeptide. Step 7 represents the recovery 
and/or identification of the variable peptide portion (s) of 
selected library members, and may be performed after any number 
of cycles of the basic scheme (steps 1-6) . 

Figure 11. Schematic overview of construction of a 
scFv display library by PCR overlap. Sequences of the 
oligonucleotides ON3149, ON3150, ON3147, ON3148, ON3193, and 
ON2 97 0 are shown hereinbelow. 



DEFINITIONS 



understood by one o± ordinary skill in the art to which this 
invention belongs. Any methods and materials similar or 
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equivalent to those described herein can be used in the 
practice or testing of the present invention, but the currently 
preferred methods and materials are described herein. For 
purposes of the present invention, the following terms are 
5 defined below. 



applied to an object refers to the fact that an object can be 
found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) 
10 that can be isolated from a source in nature and which has not 



and their abbreviations follow conventional usage 

15 ( Biochemistry . Third Edition (1988), Lubert Stryer, ed. , W.H. 
Freeman and Company, NY, which is incorporated herein by 
reference). Stereoisomers (e.g., D-amino acids) of the twenty 
conventional amino acids, unnatural amino acids such as a,a- 
disubstituted amino acids, N-alkyl amino acids, lactic acid, 

20 and other unconventional amino acids and analogs may also be 
suitable components for polypeptides of the present invention. 
Examples of unconventional amino acids include: 4- 
hydroxyprolihe, 7-carboxyglutamate, e-N, N, N-trimethyllysine, c- 
N-acetyllysine , O-phosphoserine , N-acetylserine , N- 

25 f ormylmethionine , 3-methylhistidine, 5-hydroxy lysine, oo-N- 

methylarginine, and other similar amino acids and imino acids 
(e.g., 4-hydroxyproline) . Unconventional and unnatural amino 
acids may be incorporated in vitro translation products if 
incorporated into an aminoacyl-tRNA that can participate in 

30 ribosome-mediated peptide elongation. 



a polypeptide produced by ribosome-mediated translation of a 
template mRNA, and wherein the polypeptide is associated with 



The term "naturally-occurring" as used herein as 




As used herein, the term " 



nascent peptide" refers to 
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template mRNA but can also include partially translated or 
prematurely terminated products. A "nascent single-chain 
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antibody" is a nascent polypeptide which comprises a single- 
chain antibody. 

As used herein, the term "single-chain antibody" 
refers to a polypeptide comprising a V H domain and a V L domain 
in polypeptide linkage, generally linked via a spacer peptide 
(e.g., [Gly-Gly-Gly-Gly-Ser ] x ) , and which may comprise 
additional amino acid sequences at the amino- and/or carboxy- 
termini. For example, a single-chain antibody may comprise a 
tether segment for linking to the encoding polynucleotide. As 
an example, a scFv is a single-chain antibody. Single-chain 
antibodies are generally proteins consisting of one or more 
polypeptide segments of at least 10 contiguous amino acids 
substantially encoded by genes of the immunoglobulin 
superfamily (e.g., see The Immunoglobulin Gene Superfamily . 
A.F. Williams and A.N. Barclay, in Immunoglobulin Genes , T. 
Honjo, F.W. Alt, and T.H. Rabbitts, eds., (1989) Academic 
Press: San Diego, CA, pp. 361-387, which is incorporated herein 
by reference) , most frequently encoded by a rodent, non-human 
primate, avian, porcine, bovine, ovine, goat, or human heavy 
chain or light chain gene sequence. A functional single-chain 
antibody generally contains a sufficient portion of an 
immunoglobulin superfamily gene product so as to retain the 
property of binding to a specific target molecule, typically a 
receptor or antigen (epitope) . 

As used herein, the term "complementarity-determining 
region" and "CDR" refer to the art-recognized term as 
exemplified by the Kabat and Chothia CDR definitions also 
generally known as hypervariable regions or hypervariable loops 
(Chothia and Lesk (1987) J. Mol. Biol. 196 : 901; Chothia et al. 
(1989) Nature 342 : 877; E.A. Kabat et al., Sequences of 
Proteins of Immunological Interest (National Institutes of 
Health, Bethesda, MD) (1987); and Tramontano et al. (1990) J. 
Mol. Biol. 215 : 175) . Variable region domains typically 



longer are also suitable for forming single-chain antibodies. 
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An immunoglobulin light or heavy chain variable 
region consists of a "framework" region interrupted by three 
hypervariable regions, also called CDR's. The extent of the 
framework region and CDR's have been precisely defined ( see . 
"Sequences of Proteins of Immunological Interest," E. Kabat et 
al. . 4th Ed., U.S. Department of Health and Human Services, 
Bethesda, MD (1987)). The sequences of the framework regions 
of different light or heavy chains are relatively conserved 
within a species. As used herein, a "human framework region" 
is a framework region that is substantially identical (about 
85% or more, usually SO— 35% or more) to the framework region of 
a naturally occurring human immunoglobulin. The framework 
region of an antibody, that is the combined framework regions 
of the constituent light and heavy chains, serves to position 
and align the CDR's. The CDR's are primarily responsible for 
binding to an epitope of an antigen. 

As used herein, the term "tether segment" refers to a 
portion of a nascent peptide or nascent antibody which binds to 
the encoding mRNA molecule serving as the translation template 
for the synthesis of the nascent polypeptide, or to a cDNA copy 
of such encoding mRNA, either directly or through binding an 
intermediate molecule that is linked directly to the encoding 
mRNA or cDNA copy thereof. 

As used herein, the term "variable segment" refers to 
a portion of a nascent peptide which comprises a random, 
pseudorandom, or defined kernal sequence. A variable segment 
can comprise both variant and invariant residue positions, and 
the degree of residue variation at a variant residue position 
may be limited; both options are selected at the discretion of 
the practitioner. Typically, variable segments are about 5 to 
20 amino acid residues in length (e.g., 8 to 10), although 
variable segments may be longer and may comprise antibody 
portions or receptor proteins, such as an antibody fragment, a 

c. ammo dcia sequence composed cr two or r, o r e ammo acia 

monomers and constructed by a stochastic or random process. A 
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random peptide can include framework or scaffolding motifs, 
which may comprise invariant sequences. 

As used herein "random peptide library" refers to a 
set of polynucleotide sequences that encodes a set of random 
peptides, and to the set of random peptides encoded by those 
polynucleotide sequences, as well as the fusion proteins 
containing those random peptides. 

As used herein, the term "pseudorandom" refers to a 
set of sequences that have limited variability, so that for 
example the degree of residue variability at one position is 
different than the degree of residue variability at another 
position, but any pseudorandom position is allowed some degree 
of residue variation, however circumscribed. 

As used herein, the term "defined sequence framework" 
refers to a set of defined sequences that are selected on a 
nonrandom basis, generally on the basis of experimental data or 
structural data; for example, a defined sequence framework may 
comprise a set of amino acid sequences that are predicted to 
form a j3-sheet structure or may comprise a leucine zipper 
heptad repeat motif, a zinc-finger domain, among other 
variations. A "defined sequence kernal" is a set of sequences 
which encompass a limited scope of variability. Whereas (1) a 
completely random 10-mer sequence of the 2 0 conventional amino 
acids can be any of (20) 10 sequences, and (2) a pseudorandom 
10-mer sequence of the 2 0 conventional amino acids can be any 
of (20) 10 sequences but will exhibit a bias for certain 
residues at certain positions and/or overall, (3) a defined 
sequence kernal is a subset of sequences which is less that the 
maximum number of potential sequences if each residue position 
was allowed to be any of the allowable 20 conventional amino 
acids (and/or allowable unconventional amino/imino acids) . A 
defined sequence kernal generally comprises variant and 
invariant residue positions and/or comprises variant residue 
pos i t" i on s which ra^ ^onpri n >-<-><- ^ ? n c-o i f ror ^ * ' ■>- - 1 



selected library member sequence. Defined sequence kernals can 
refer to either amino acid sequences or polynucleotide 
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sequences. For illustration and not limitation, the sequences 
(NNK) 10 and (NNM) 10 , where N represents A, T, G, or C; K 
represents G or T; and M represents A or C, are defined 
sequence kernals. 

As used herein "RNA binding protein" refers to a 
protein that specifically interacts with a polyribonucleotide 
strand or strands. Those of skill in the art will recognize 
that, for purposes of the present invention, the RNA binding 
protein must bind specifically to the template mRNA, for 
example the RNA binding protein may bind to a specific sequence 

of the mRNA whiuh will suppress reinitiation of new translation 

from the template mRNA. In embodiments of the invention in 
which DNA binding polypeptides are used, DNA binding proteins 
are typically those proteins which bind to DNA, in a sequence- 
specific or sequence-insensitive manner (e.g., helix-loop- 
helix, zinc finger, homeodomain, histone, etc.). 

In some embodiments, DNA-binding proteins can bind to 
DNA in a sequence-specific manner (e.g., bind to specific 
predetermined nucleotide sequences); in such embodiments, the 
nascent polypeptide library members comprise an encoding 
polynucleotide (or DNA primer bound thereto) which comprises a 
sequence bound by the sequence specific DNA-binding protein. 
As used herein, the term "polynucleotide-binding protein" 
encompasses RNA-binding proteins and DNA-binding proteins, 
whether sequence-specific or sequence-insensitive. 

As used herein "epitope" refers to that portion of an 
antigen or other macromolecule capable of forming a binding 
interaction that interacts with the variable region binding 
pocket of an antibody. Typically, such binding interaction is 
manifested as an intermolecular contact with one or more amino 
acid residues of a CDR. 

As used herein, "receptor" refers to a molecule that 

has an affinity for a given ligand. Receptors can be naturally 

• - ■ . ♦ 

Keceptors can t-i attacneu, covaiently o: noncovaientiy, to a 
binding member, either directly or via a specific binding 
substance. Examples of receptors include, but are not limited 
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to, antibodies, including monoclonal antibodies and antisera 
reactive with specific antigenic determinants (such as on 
viruses , cells, or other materials) , cell membrane receptors, 
complex carbohydrates and glycoproteins, enzymes, and hormone 
5 receptors . 

As used herein "ligand" refers to a molecule, such as 
a random peptide or variable segment sequence, that is 
recognized by a particular receptor. As one of skill in the 
art will recognize, a molecule (or macromolecular complex) can 

10 be both a receptor and a ligand. In general, the binding 

pax until havxjiy a smaller molecular weight is referred to as the 
ligand and the binding partner having a greater molecular 
weight is referred to as a receptor. 

As used herein, "linker" or "spacer" refers to a 

15 molecule or group of molecules that connects two molecules, 

such as a DNA binding protein and a random peptide, and serves 
to place the two molecules in a preferred configuration, e.g., 
so that the random peptide can bind to a receptor with minimal 
steric hindrance from the DNA binding protein. 

20 As used herein, the term "operably linked" refers to 

a linkage of polynucleotide elements in a functional 
relationship. A nucleic acid is "operably linked" when it is 
placed into a functional relationship with another nucleic acid 
sequence. For instance, a promoter or enhancer is operably 

25 linked to a coding sequence if it affects the transcription of 
the coding sequence. Operably linked means that the DNA 
sequences being linked are typically contiguous and, where 
necessary to join two protein coding regions, contiguous and in 
reading frame. 

30 As used herein, "glycosylating cell" is a cell 

capable of glycosylating proteins, particularly eukaryotic 
cells capable of adding an N-linked "core oligosaccharide" 
containing at least one mannose residue and/or capable of 

• -Jriicu^di ^ ; . .-.y^i eLeu l; Jle^ ll ^ , , , ^ub; xduiriLj cei, 

contains at least one enzymatic activity that catalyzes the 
attachment of a sugar residue to a glycosylating site sequence 
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in a protein or polypeptide, and the cell actually glycosylates 
at least one expressed polypeptide. For example but not for 
limitation, mammalian cells are typically glycosylating cells. 
Other eukaryotic cells, such as insect cells and yeast, may be 
5 glycosylating cells. 

DETAILED DESCRIPTION 

Generally, the nomenclature used hereafter and many 
of the laboratory procedures in cell culture, molecular 

10 genetics, and nucleic acid chemistry and hybridization 

described below are those well known and commonly employed in 
the art. Standard techniques are used for recombinant nucleic 
acid methods, polynucleotide synthesis, in vitro polypeptide 
synthesis, and the like and microbial culture and 

15 transformation (e.g. , electroporation) . Generally enzymatic 

reactions and purification steps are performed according to the 
manufacturer's specifications. The techniques and procedures 
are generally performed according to conventional methods in 
the art and various general references ( see , generally , 

2 0 Sambrook et al. Molecular Cloning: A Laboratory Manual, 2d ed. 
(1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y.; and Antibodies: A Laboratory Manual , (1988) E. Harlow and 
D. Lane, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
each of which is incorporated herein by reference) which are 

2 5 provided throughout this document. The procedures therein are 
believed to be well known in the art and are provided for the 
convenience of the reader. All the information contained 
therein is incorporated herein by reference. 

Oligonucleotides can be synthesized on an Applied Bio 

30 Systems oligonucleotide synthesizer according to specifications 
provided by the manufacturer. 

Methods for PCR amplification are described in the 
art ( PCR Technology: Principles and Applications for DNA 



eds. Innis. GeiManu, snisKv.. i nu White, Academic Press, san 
Diego, CA (1990); Mattila et al. (1991) Nucleic Acids Res. 19: 
4967; Eckert, K-A. and Kunkel, T.A. (1991) PCR Methods and 
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Applications 1: 17; and U.S. Patent Nos. 4,683,202 and 
4,965,188, each of which are incorporated herein by reference) 
and exemplified hereinbelow. 

Overview 

The present invention provides novel compositions and 
methods for screening in vitro polysome libraries displaying 
nascent peptides comprising a random, pseudorandom, or defined 
sequence polypeptide framework. In an aspect of the invention, 
polysome libraries display single-chain antibodies comprising a 
Vg domain, domain, and spacer peptide. 

Generally, a single-chain expression polynucleotide 
is generated. This expression polynucleotide contains: (1) a 
single-chain antibody cassette consisting of a V H domain, 
spacer peptide, and V L domain operably linked to encode a 
single-chain antibody, (2) a promoter suitable for in vitro 
transcription (e.g., T7 promoter, SP6 promoter, and the like) 
operably linked to ensure in vitro transcription of the single- 
chain antibody cassette forming a mRNA encoding a single-chain 
antibody, and (3) a transcription termination sequence suitable 
for functioning in an jji vitro transcription reaction. 
Optionally, the expression polynucleotide may also comprise an 
origin of replication and/or a selectable marker. An example 
of a suitable expression polynucleotide is pLM166 ( see , EXAMPLE 
2) . 

The V H and V L sequences can be conveniently obtained 
from a library of V H and V L sequences produced by PCR 
amplification using V gene family-specific primers or V gene- 
specific primers (Nicholls et al. (1993) J, Immunol. Meth. 165 : 
81; W093/12227) or are designed according to standard art -known 
methods based on available sequence information. Typically, 
mouse or human V H and V L sequences are isolated. The V H and V L 
sequences are then ligated, usually with an intervening spacer 

- i orary oomprismq ^ . ^ i a ^. . . ; ,_. , ; . a;iu v - bequenceb uie useu 
(sometimes also with a plurality of spacer peptide species 
represented) . Frequently, a library is constructed wherein one 
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or more of the V H and V L sequences are mutated to increase 
sequence diversity, particularly at CDR residues, sometimes at 
framework residues. V region sequences can be conveniently 
cloned as cDNAs or PCR amplification products for 
5 immunoglobulin-expressing cells. For example, cells from human 
hybridoma, or lymphoma, or other cell line that synthesizes 
either cell surface or secreted immunoglobulin are used for the 
isolation of polyA+ RNA. The RNA is then used for the 
synthesis of oligo dT primed cDNA using the enzyme reverse 

10 transcriptase (for general methods see , Goodspeed et al. (1989) 
Gene 76 : i; Dunn et al ♦ (1959) J. Biol. Chem. 2G4 : 13057) • 
Once the V-region cDNA or PCR product is isolated, it is cloned 
into a vector to form a single-chain antibody cassette. For 
example and not limitation, the CANTAB vector system (sold 

15 commercially by Pharmacia Biotech, Alameda, CA) and its 

variants are suitable for cloning V H and V L sequences by PCR 
amplification. The phagemid pSEx (Dubel et al. (1993) Gene 
128 ; 97) and similar vectors are suitable for surface display 
of scFv on bacteriophage. 

20 In one aspect, the present invention provides an 

improved method, using an in vitro translation system for 
translating mRNA to form polysomes displaying nascent peptides, 
including nascent single-chain antibodies, which in one 
variation are scFv. This aspect of the invention comprises 

2 5 using an coli S3 0 translation system (Promega, Madison, 

Wisconsin) for efficient in vitro translation. The JU. coli S30 
translation system provides advantageous high efficiency 
translation of a variety of mRNA templates, as compared to 
other in vitro translation systems (e.g., wheat germ extract, 

30 rabbit reticulocyte lysate) . Furthermore, the coli S30 

system can provide a coupled transcription/translation system 
which is generally more convenient to use and efficient than an 
uncoupled system. In addition, the S30 system for in vitro 

' *■ - ;-■ V " - +- V, r 

T ; 

' p.e construction c: very . a rae ..cranes ia tne metnoas oi tne 
invention. Thus, while the invention is typically practiced 
with reaction volumes of 50 microlitres to 5 raL, one can also 
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prepare libraries in reaction volumes of 5 mL to 50 mL (or even 
larger) by the present methods. The S30 system is also 
amenable to the incorporation of unnatural amino acids using 
tRNA molecules charged with unnatural amino acids. See PCT 
patent publication No. 9 0/05785, incorporated herein by 
reference. 

In another aspect, the present invention provides 
improved binding and/or washing conditions for screening 
polysome peptide-display libraries and single-chain antibody 
display libraries. In general, this improvement comprises: 

(1) isolating polysomes from an in vitro translation reaction 
by ultracentrifugation prior to screening the recovered 
polysomes for high-affinity binding to a receptor or epitope, 
and optionally the pellet containing the centrif ugation-, 
purified polysomes is resolubilized in a suitable buffer (i.e., 
does not disrupt intact polysomes) and centrifuged a second 
time to further purify the polysome population prior to 
affinity screening with receptor or epitope, and/or 

(2) reducing non-specific binding of nascent peptide-displaying 
polysomes or nascent single-chain antibody-displaying polysomes 
by contacting a preblocking agent (e.g., nonfat milk, casein, 
bovine serum albumin, gelatin, tRNA) to the immobilized 
receptor or epitope prior to affinity screening. A non-ionic 
detergent may optionally also be added. 

In another aspect, the invention provides a method 
for generating nascent peptide-polysome libraries or nascent 
single-chain antibody-polysome libraries by coupled in vitro 
transcription/translation using an E_;_ coli S30 system. This 
improvement avoids the bacteriophage-display method which 
requires replication and/or transcription of the DNA templates 
in a cell, which may reduce the diversity of the library and/or 
skew the distribution of the relative abundances of individual 
library members. Moreover, the coupled coli system is 
highly efficient and the library size is not limited bv f-hp 

In another aspect, the invention provides an 
improvement to the general method of screening nascent peptide- 
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polysome libraries. This improvement can be used in 
conjunction with single-chain antibody polysome libraries. The 
improvement comprises the step of taking DNA sequences produced 
from positive nascent peptide-polysomes (or single-chain 
antibody polysomes) obtained after one or more rounds of 
affinity screening and performing one or more additional rounds 
of affinity screening by a different screening method, such as 
by expression of the selected DNA sequence (s) in a 
bacteriophage coat protein display system, by expression as a 
soluble antibody in a prokaryotic or eukaryotic expression 
system, or by various methods for in vitro expression. For 
example, expression of scFv in eukaryotic expression systems, 
particularly in glycosylating cells, has the benefit of 
avoiding potential aggregation and misfolding of the scFv which 
may occur in some prokaryotic-based expression systems, as well 
as producing a glycosylated scFv, if said scFv contains 
suitable glycosylating site sequence (s) . 

For example, bacteriophage antibody display libraries 
can be created from selected sequences by subcloning the 
positive (i.e., selected) DNA sequence(s) into a phagemid 
vector (e.g., pAFF6) wherein the subcloned DNA is expressed as 
a fusion with a bacteriophage coat protein (e.g., pill or 
pVIII) in the same reading frame as the nascent peptides (or 
single-chain antibodies) of the positive polysomes. The 
phagemid is propagated to produce bacteriophage particles 
displaying the nascent peptide sequence (or single-chain 
antibody) as a fusion with a phage coat protein. This 
improvement also relates to subcloning the nucleic acids 
encoding positive peptide-polysomes into other selection 
systems, such as the peptides on plasmids (using, e.g., lac as 
the DNA binding protein) or the maltose binding protein systems 
discussed above. 

The peptide-displaying phage (or other, depending on 

:nromatoqrapny , ana the uKe using an 1 nmoDi i izea receptor or' 
epitope (PCT Publication Nos. 91/17271, 91/18980, and 
93/08278) . Thus, in some embodiments, the phage (or phagemid) 



WO 95/1 1922 PCT/US94/12206 

34 

particle is used in an ELISA to determine the specificity of 
peptide binding. The availability of such assays and selection 
methods for the phage (or other) selection systems allows other 
advantages to be realized from the improved polysome display 
5 method of the present invention. In one embodiment , the 
variable region of nucleic acid that is expressed by the 
polysome and tested for receptor binding is a concatemer of 
short (i.e., 6 to 20 amino acids in length) peptide coding 
sequences optionally linked through nucleotides that are a 

10 restriction enzyme recognition site. After selection, the 
concatemer is cleaved with the restriction enzyme and the 
fragments (encoding the individual peptides) are cloned into 
the secondary selection system (i.e., the peptides on phage 
system) , where a single panning cycle (binding of peptide to 

15 receptor and washing away unbound peptides) will serve to 
enrich the library with the peptide sequences from the 
concatemer that encode the ligands of interest. One could also 
use the process of concatemerization to combine and sequence 
together a number of individual peptide encoding sequences from 

2 0 a pool of positive peptide-polysomes. 

In one embodiment, the single-chain antibody-encoding 
portion of the polynucleotide that is expressed by the polysome 
and tested for epitope binding encodes a V H and V L which are 
flanked by convenient restriction sites to facilitate the 

25 excision of the V H sequence, V L sequence, or both. After 
selection, the site(s) is/are cleaved with the restriction 
enzyme (s) and the fragments (encoding the individual domains or 
entire scFv) are cloned into a secondary selection system 
(e.g., antibody bacteriophage display system), where a single 

30 panning cycle (binding of single-chain antibody to epitope and 
washing away unbound single-chain antibodies) will serve to 
enrich the library for members that encode the single-chain 
antibodies of interest. 

With reaard to npthod^ of a^npratirn npptirip- 



transcr iption/ translation system (e.g., Ej_ coli S30) to produce 
a very large library of nascent peptide-polysomes (single-chain 
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antibody-polysomes) which is initially screened for ligand- 
binding species, epitope-binding species, or receptor-binding 
species. In a coupled system, DNA encoding the library is 
added to the extract for performing transcription and 
5 translation. Of course, one can also use an uncoupled system, 
producing the RNA in one reaction and then adding that RNA to 
an in vitro translation system. After production, screening 
and selection, the positive isolates (e.g., enriched pools of 
positive isolates) are then transferred into a bacteriophage 

10 display system that may be screened further for receptor or 

epitope binding species using a variety of assays (such as the 
ELISA noted above) and screening conditions, including assays 
and selection steps that might not be compatible with intact 
polysomes. Moreover, once positive sequences have been 

15 inserted into a bacteriophage peptide-display vector (e.g., 
pAFF6) , they may be conveniently mutagenized (e.g., with 
mutagenic PCR and/ or site-directed oligonucleotide mutagenesis 
(e.g., in M13) and/or chemical mutagenesis for producing 
advantageous sequence variants. Thus, single-chain antibodies 

20 which are isolated after an initial round (or multiple rounds, 
which may include display on phage, expression as a soluble 
scFV in a prokaryotic or eukaryotic cell, or in vitro 
expression, in any order) of screening can be cloned into a 
bacteriophage antibody-display vector and can be mutagenized 

25 further, typically by limited sequence diversification in or 

near one or more of the CDRs, to effectively mirror the in vivo 
process known as "affinity sharpening". The diversified 
antibody library can then be screened according to conventional 
bacteriophage antibody-display methods. Alternatively, single- 

30 chain antibodies which are isolated after an initial round (or 
multiple rounds) of screening can be retained in a polysome- 
display vector and can be mutagenized further; the diversified 
single-chain antibody-polysome library can be screened 



rne present invention also provides random, 
pseudorandom, and defined sequence framework peptide libraries 
and methods for generating and screening those libraries to 
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identify useful compounds (e.g., peptides, including single- 
chain antibodies) that bind to receptor molecules or epitopes 
of interest or gene products that modify peptides or RNA in a 
desired fashion. The random, pseudorandom, and defined 
sequence framework peptides are produced from libraries of 
nascent peptide library members that comprise nascent peptides 
or nascent single-chain antibodies attached to an mRNA template 
from which the nascent peptide was synthesized by in vitro 
translation, or attached to a DNA primer hybridized to the mRNA 
or to a cDNA copy of the mRNA template. The mode of attachment 
may vary according to the specific embodiment of the invention 
selected. 

A method of affinity enrichment allows a very large 
library of peptides and single-chain antibodies to be screened 
and the polynucleotide sequence encoding the desired peptide (s) 
or single-chain antibodies to be selected. The polynucleotide 
can then be isolated and sequenced to deduce the amino acid 
sequence of the selected peptide (s) or single-chain antibodies 
(or just V H , V L , or CDR portions thereof) . Using these 
methods, one can identify a peptide or single-chain antibody as 
having a desired binding affinity for a molecule. The peptide 
or antibody can then be synthesized in bulk by conventional 
means . 

A significant advantage of the present invention is 
that no prior information regarding an expected ligand 
structure is required to isolate peptide ligands or antibodies 
of interest. The peptide identified can have biological 
activity, which is meant to include at least specific binding 
affinity for a selected receptor molecule and, in some 
instances, will further include the ability to block the 
binding of other compounds, to stimulate or inhibit metabolic 
pathways, to act as a signal or messenger, to stimulate or 
inhibit cellular activity, and the like. 



~. .- , ■ . - — , * ^ * j j j. t , w <jx ^> LZ I i v. f.; t: p L i U 6 b CdJi iJt. 

generated by a variety of methods. Generally, an in vitro 
translation system is employed to generate polysomes from a 
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population of added mRNA species. Often, the in vitro 
translation system used is a conventional eukaryotic 
translation system (e.g., rabbit reticulocyte lysate, wheat 
germ extract) . However, an coli S3 0 system (Promega, 
5 Madison, Wisconsin) can be used to generate the polysome 

library from a population of added mRNA species or by coupled 
transcription/ translation (infra) . Suitable coli S3 0 
systems may be produced by conventional methods or may be 
obtained from commercial sources (Promega, Madison, Wisconsin) . 

10 The EL_ coli S30 translation system is generally more efficient 
at producing polysomes suitable for affinity screening of 
displayed nascent peptides, and the like. Moreover, a 
prokaryotic translation system, such as the E-_ coli S3 0 system, 
has the further advantage that a variety of drugs which block 

15 prokaryotic translation (e.g., inhibitors of ribosome 

function), such as rifampicin or chloramphenicol, can be added 
at a suitable concentration and/or timepoint to stall 
translation and produce a population of stalled polysomes, 
suitable for affinity screening against a predetermined 

20 receptor or epitope (e.g., a G protein-linked receptor 
protein) . 

In general, the improved method comprises the steps 
of: (1) introducing a population of mRNA species into a 
prokaryotic in vitro translation system (e.g. , Ej_ coli S30) 

25 under conditions suitable for translation to form a pool of 

polysomes displaying nascent peptides or nascent single-chain 
antibodies (e.g., stalled polysomes), so-called polysome- 
forming conditions; (2) contacting the polysomes with a 
predetermined receptor or epitope under suitable binding 

30 conditions (i.e., for specific binding to the receptor/ epitope 
and for preserving intact polysome structure) ; (3) selecting 
polysomes which are specifically bound to the receptor or 
epitope (e.g., by removing unbound polysomes by washing with a 

said cDNA or amplification product) . Often, the receptor or 
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epitope used for screening is immobilized, such as by being 
bound to a solid support. 

In a variation of the improved method, the population 
of mRNA molecules is introduced into the in vitro translation 
5 system by de novo synthesis of the mRNA from a DNA template. 
In this improvement, a population of DNA templates capable of 
being transcribed in vitro (e.g., having an operably linked T7 
or SP6 or other suitable promoter) are introduced into a 
coupled in vitro transcription/translation system (e.g., an E_j_ 
10 coli S30 system) under conditions suitable for in vitro 

Generally, using a coupled in vitro transcription/translation 
system is highly efficient for producing polysomes displaying 
nascent peptides and single-chain antibodies suitable for 

15 affinity screening. Of course, and as noted above, uncoupled 
systems may also be used, i.e., by adding mRNA to an in vitro 
translation extract. 

A further improvement to the general methods of 
screening nascent peptide-displaying polysomes and single-chain 

2 0 antibody-displaying polysomes comprises the additional step of 
adding a preblocking agent (e.g., nonfat milk, serum albumin, 
tRNA, and/or gelatin) prior to or concomitant with the step of 
contacting the nascent peptide-displaying polysomes with an 
immobilized receptor or the nascent single-chain antibody- 

25 displaying polysomes with an immobilized epitope. The 

additional step of adding a preblocking agent reduces the 
amount of polysomes which bind nonspecif ically to the receptor 
or epitope and/or to the immobilization surface (e.g., 
microtitre well) , thereby enhancing the specificity of 

30 selection for polysomes displaying peptides that specifically 
bind to the receptors (s) or antibodies which specifically bind 
the predetermined epitope (s) . Although the preblocking agent 
can be selected from a broad group of suitable compositions, 



preferable. Other suitable preblocking agents can be used. 
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Preblocking agents that do not substantially interfere with 
specific binding (i.e., non-interfering) are suitable. 

A further improvement to the general methods of 
screening nascent peptide-displaying polysomes comprises the 
additional step of isolating polysomes from an in vitro 
translation reaction (or a coupled in vitro 
transcription/translation reaction) prior to the step of 
contacting the nascent peptide-displaying polysomes with 
immobilized receptor. Generally, the polysomes are isolated 
from a translation reaction by high speed centrifugation to 
pellet the polysomes, so that the polysome pellet is recovered 
and the supernatant containing contaminants is discarded. The 
polysome pellet is resolubilized in a suitable solution to 
retain intact polysomes. The resolubilized polysomes may be 
recentrifuged at lower speed (i.e., which does not pellet 
polysomes) so that the insoluble contaminants pellet and are 
discarded and the supernatant containing soluble polysomes is 
recovered, and the supernatant used for affinity screening. 
Alternatively, the resolubilized polysomes may be used for 
affinity screening directly (i.e., without low speed 
centrifugation) . Furthermore, the order of centrifugation may 
be reversed, so that low speed centrifugation is performed 
prior to high speed centrifugation; the low speed 
centrifugation supernatant is then centrifuged at high speed 
and the pelleted polysomes are resolubilized and used for 
affinity screening. Multiple rounds of high speed and/or low 
speed centrifugation may be used to increasingly purify the 
polysomes prior to contacting the polysomes with the 
immobilized selection receptor (s) or epitope (s). 

Another improvement to the general methods of 
affinity screening of nascent peptide-displaying polysomes 
comprises adding a non-ionic detergent to the binding and/or 
wash buffers. Non-ionic detergent (e.g., Triton X-100, NP-40, 

■ <...., ..... . 

*itii tne lmmofcilizeu receptor; ana/or tne wasn cutter -^.e.. 
the aqueous solution used to wash the bound polysomes (i.e., 
bound to the immobilized receptor) . Generally, the non-ionic 
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detergent is added to a final concentration of about between 
0.01 to 0,5% (v/v) , with 0,1% being typical. 

Another improvement to the general methods of 
affinity screening of nascent peptide libraries is generating 
5 the DNA template library (from which the mRNA population is 
transcribed) in vitro without cloning the library in host 
cells. Cloning libraries in host cells frequently diminishes 
the diversity of the library and may skew the distribution of 
the relative abundance of library members. In vitro library 

10 construction generally comprises ligating each member of a 
population of polynucleotides encoding library members to a 
polynucleotide sequence comprising a promoter suitable for in 
vitro transcription (e.g., T7 promoter and leader). The 
resultant population of DNA templates may optionally be 

15 purified by gel electrophoresis. The population of DNA 

templates is then transcribed and translated in vitro . such as 
by a coupled transcription/translation system (e.g., coli 
S30) . 

A further improvement to the general methods of 
20 affinity screening comprises the added step of combining 

affinity screening of a nascent peptide-displaying polysome 
library with screening of a bacteriophage peptide display 
library (or other, i.e., peptides on plasmids, expression as 
secreted soluble antibody in host cells, in vitro expression) . 
25 In this improvement, polysomes are isolated by affinity 

screening of a nascent peptide-display library. The isolated 
polysomes are dissociated, and cDNA is made from the mRNA 
sequences that encoded nascent peptides that specifically bound 
to the receptor (s). The cDNA sequences encoding the nascent 
3 0 peptide binding regions (i.e., the portions which formed 
binding contacts to the receptor(s); variable segment 
sequences) are cloned into a suitable bacteriophage peptide 
display vector (e.g., pAFF6 or other suitable vector). The 

: nage cxoneb express c;. ^rieii .^rio:; M,:uict: li:- k^^ybume- 
derived peptide sequences as fusions to a coat protein (e.g., 
as an N-terminal fusion to the pill coat protein) . By 
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incorporating the in vitro -enriched peptide sequences from the 
polysome screening into a bacteriophage display system, it is 
possible to continue affinity selection for additional rounds. 
It is also advantageous, because the resultant bacteriophage 
5 display libraries can be screened and tested under conditions 
that might not have been appropriate for the intact polysomes. 
For example, although the monovalent display that can be 
achieved with the polysome system has advantages in isolating 
high affinity ligands (depending on conditions, a multivalent 

10 ligand composed of several copies of a low affinity ligand can 
have a very high affinity) , there may be other circumstances 
where multivalent display (which can be achieved with the phage 
system) is desirable for binding to the receptor (s) under 
binding conditions that may be incompatible with intact 

15 polysomes. The same combined polysome/bacteriophage screening 
sequence can be used for single-chain antibodies. In one 
aspect of the invention a bacterial host cell is transformed or 
infected with a bacteriophage expression vector, which vector 
comprises a DNA library member which encodes a fusion protein 

20 composed of a V H and a V L in peptide linkage to the amino- 

terminus of a filamentous bacteriophage coat protein sequence, 
typically a pill or pVIII sequence. 

Another improvement to the methods of affinity 
screening is the control of display valency (i.e., the average 

25 number of functional scFv displayed per polysome or per phage 
particle) , and the capacity to vary display valency in 
different rounds of affinity screening. Typically, a high 
display valency permits many binding contacts between the 
polysome (or phage particle) and epitope, thus affording stable 

30 binding for polysomes (or phage particles) which encode scFv 
species which have relatively weak binding. Hence, a high 
display valency system allows screening to identify a broader 
diversity range of scFv species, since even lower affinity scFv 

■■^cKv. toy select my nigh aitmitv scFv : rom ,i dooi o: 
mutagenized low-to-medium affinity scFv clones. Thus, affinity 
sharpening by mutagenesis and subsequent rounds of affinity 



WO 95/1 1922 PCT/US94/12206 

42 

selection can be used in conjunction with a broader pool of 
initially selected scFv sequences if a high display valency 
method is used. Alternate rounds of high display valency 
screening and low display valency screening can be performed, 
in any order, starting from either a high or low valency 
system, for as many affinity screening rounds as desired, with 
intervening mutagenesis (directed, random, pseudorandom, CDR- 
clustered, etc.) and scFv sequence diversity broadening, if 
desired. Alternate rounds of affinity screening, wherein a 
first round consists of screening a scFv library expressed in a 
high valency display system, selecting scFv clones which bind 
the predetermined epitope, optionally conducting a mutagenesis 
step to expand the sequence kernal of the selected scFv 
sequence(s), expressing the selected scFv clones in a lower 
valency display system, and selecting scFv clones which bind 
the predetermined epitope, can be performed, including various 
permutations and combinations of multiple screening cycles, 
wherein each cycle can be of a similar or different display 
valency. This improvement affords an overall screening program 
that employs systems which are compatible with switchable 
valency (i.e., one screening cycle can have a different display 
valency than the other (s) , and can alternate in order). 

Display valency can be controlled by a variety of 
methods, including but not limited to: controlling the average 
number of nascent peptides per polysome in a polysome-display 
system, and controlling the average number of coat protein 
molecules which comprise a displayed scFv sequence per phage 
particle. The former can be controlled by any suitable method, 
including: (1) altering the length of the encoding mRNA 
sequence to reduce or increase the frequency of translation 
termination (a longer mRNA will typically display more nascent 
peptides per polysome than a shorter mRNA encoding sequence) , 
(2) incorporating stalling (i.e., infrequently used) codons in 

■ iructure-i ermine sequencer ■. . ^ . , .d irpi;,, crucirorm, etc. , 
distal to the scFv-encoding portion and proximal to (upstream, 
5' to) the translation termination site, if any, and/or (4) 
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including an antisense polynucleotide (e.g., DNA, RNA, 
polyamide nucleic acid) that hybridizes to the mRNA distal to 
the scFv-encoding portion and proximal to (and possibly 
spanning) the translation termination site, if any. The length 
5 of the mRNA may be increased to increase display valency, such 
as by adding additional reading frame sequences downstream of 
the scFv-encoding sequence (s) ; such additional reading frame 
sequences can, for example, encode the sequence (-AAVP-) n , 
where n is typically at least 1, frequently at least 5 to 10, 
10 often at least 15 to 25, and may be at least 50-100, up to 

' J- i . _ «• r- ^> j r\ WAM _ _> "J ^ I. _ « — V. 1 w ^ wn ^» w 4- 1 * » ~ 

longer stall sequence can be used. Stalling codons (i.e., 
codons which are slowly translated relative to other codons in 
a given translation system) can be determined empirically for 

15 any translation system, such as by measuring translation 

efficiency of mRNA templates which differ only in the presence 
or relative abundance of particular codons. For example, a set 
of scFv clones can be evaluated in the chosen translation 
system; each scFv species or the set has a stalling polypeptide 

20 sequence of 25 amino acids, but each stalling polypeptide 

sequence consists of a repeating series of one codon, such that 
all translatable codons are represented in the set. When 
translated under equivalent conditions, the scFv species which 
produce polysomes having the highest valency (e.g., as 

25 determined by sedimentation rate, buoyancy, electron 

microscopic examination, and other diagnostic methods) thereby 
identify stalling codons as the codon (s) in the stalling 
polypeptide sequence . 

In one embodiment, a stalling polypeptide sequence is 

30 distal (3 1 to) the scFv-encoding sequence, and comprises -(Gly- 
Gly-Gly-Gly-Ser) 4 -A-A-V-P-, or repeats thereof. 

Alternatively, or in combination with the noted 
variations, the valency of the target epitope may be varied to 

. Tn .., r . s j.. , ,. ., . 

.411^-.!...,. ... .. 

iUDstratc <i ; varying densities, sucn a-, l , i nciuainq a 
competitor epitope, by dilution, or by other method known to 
those in the art. A high density (valency) of predetermined 
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epitope can be used to enrich for scFv library members which 
have relatively low affinity, whereas a low density (valency) 
can preferentially enrich for higher affinity scFv library 
members . 

Each of the improvements to the methods of affinity 
screening may be combined with other compatible improvements. 
For example, an ir\ vitro transcription/translation system can 
be used in conjunction with a library of DNA templates 
synthesized in vitro (i.e. without cloning in a host cell). 
The resultant polysomes can be purified by one or more rounds 
of high-speed and/ or low-speed cantr if ligation . The purified 
polysomes can be contacted with an immobilized receptor that is 
preblocked (e.g., with nonfat milk), and a non-ionic detergent 
may also be present to further reduce nonspecific binding. The 
selected polysomes may then be used as templates for 
synthesizing cDNA which is then cloned into a bacteriophage 
display vector, such that the variable segments of the nascent 
peptides are now displayed on bacteriophage. The improved 
methods can also be used in conjunction with the tethered 
nascent peptide methods (infra) . 

Methods for Tethered Nascent Peptide Polysomes 
In one aspect, the present invention relates to an 
improved method for using in vitro translation to produce 
25 peptide and single-chain antibody libraries; the improvement 
relates to the elimination of the polysome from the screening 
(receptor binding step or epitope binding step) . 

A basis of the present invention is the physical 
linkage of a nascent peptide to a polynucleotide sequence 
3 0 complementary to or corresponding to the mRNA that served as 
the template for the nascent peptide's or single-chain 
antibody's synthesis. In this improved aspect of the 
invention, this physical linkage is accomplished without 

. r/Jiuamu ^laDii,: , . .>bue k -, . . .: l : . • v c ...cliiou., . .•. 

present invention avoid the need to isolate polysomes for 
screening nascent peptides and nascent antibodies, and thereby 
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provide several advantages, such as affording the use of 
structural complexes which are more stable than polysomes and 
removing the ribosomes as a source of steric hindrance and non- 
specific binding during subsequent screening steps* 
5 The peptide library is generated by in vitro 

synthesis in a cell-free system, wherein individual library 
members comprise a nascent polypeptide. The nascent 
polypeptide (including single-chain antibody) is synthesized as 
a fusion protein comprising (l) a first polypeptide portion, 

10 termed the "tether segment", comprising a polypeptide sequence 
that binds to the encoding mRNA molecule serving as the 
translation template for the synthesis of the nascent 
polypeptide, or to a bound DNA primer or cDNA copy of such 
encoding mRNA , either directly or through binding an 

15 intermediate molecule that is linked directly to the encoding 
mRNA, DNA primer, or cDNA copy thereof, and (2) a second 
polypeptide portion, termed the "variable segment" or single- 
chain antibody portion, comprising one of a variety of possible 
amino acid sequence combinations represented in the library* 

20 The variable segment may be of various lengths as well as 

sequences, and typically peptide variable segments comprise 
from 2 to about 50, typically about 5 to 20, amino acid 
residues, although they may range from up to 50-500 residues or 
more for polypeptide variable segments, (See, U.S. Patent 

25 5,223,409, incorporated herein by reference.) The translation 
conditions selected are suitable for permitting the tether 
segment of the nascent polypeptide to bind to its encoding 
polynucleotide before significant dissociation and diffusion of 
the nascent peptide from the translation complex occurs, and 

3 0 also to reduce binding between translation complexes. It may 
be desirable to stall or slow the elongation cycle of ribosomal 
translocation to increase the probability of forming the 
linkage between the tether segment and the polynucleotide 

r- ** : ■ t +• ; , . ■„ 4 .,. ... .... + : , r .., ; .. ( , 1 , . r , „, r , ~ r , , , r .. v . 7 

K ransiocation , .nciuamq l'jl no: unitea ic, engineering 
secondary structure into the mRNA species to stall translation 
at a predetermined site carboxy-terminal to the tether segment 
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(and preferably carboxy-terminal to the variable segment) , 
annealing a polynucleotide (e.g., DNA) primer to the 3 1 portion 
of the mRNA to inhibit complete translation, using rare codons 
at the 3' end of the coding sequence (and/or altering ratios of 
selected amino-acyl tRNA species in the translation reaction) , 
and including a low concentration of translational 
inhibitor (s) , including a translation stall sequence (e.g., may 
be selected from a library by selecting for stalled polysomes) , 
among others . 

Various strategies may be used to link the tether 

SGCrmGnt tO the DOlvnUCleotide CO^t 3 i n -i r»rr tho onnnHinn 

information of the nascent peptide or single-chain antibody. 

Various strategies may be used to link the tether 
segment to the polynucleotide containing the encoding 
information of the nascent peptide. In a basic method of the 
invention, a population of messenger RNA molecules which 
individually encode a fusion protein comprising a common tether 
segment sequence and one of a variety of variable segment 
sequences represented in the random, pseudorandom, or defined 
sequence framework peptide sequence library is generated. The 
mRNA population can be generated by any of various methods 
known in the art, but in vitro transcription of synthetic DNA 
templates is a convenient method. For example, a plasmid 
containing an promoter (e.g., a T7 promoter) capable of driving 
in vitro transcription of an operably linked polynucleotide 
sequence encoding a tether segment and possessing a restriction 
site for insertion of a variable segment sequence (s) or single- 
chain antibody encoding cassette may be prepared in large 
scale. The plasmid can be digested with the appropriate 
restriction enzyme to open the site for insertion of the 
variable segment sequence (s) or single-chain antibody 
cassette (s) . For generating diverse variable segments, a 
collection of synthetic oligonucleotides encoding random, 

■ + « - .I 
■ i.ini lui x j j , .«'...■ ::■.-> vj Li ti 1 \ <,'■ v..* 4 v e: t -_j _l L , ■. . v. , . , ^ I c~ s_ I\ v„ » L n C 

single-chain antibody cassette (s) can be expanded by mutating 
the CDR(s) with site-directed mutagenesis, CDR-replacement , and 
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the like. The resultant DNA molecules can be propagated in a 
host for cloning and amplification or can be used directly 
(i.e., may avoid loss of diversity which may occur upon 
propagation in a host cell) ; in either case, purified DNA is 
transcribed in vitro with the appropriate RNA polymerase (e.g., 
T7 polymerase) to form the population of mRNA molecules 
encoding the nascent peptide library or nascent single-chain 
antibody library. 

The population of mRNA molecules are translated, 
typically in an in vitro translation system, such as a 

suitable jjj vitro transcription system. The cell-free 
continuous-flow (CFCF) translation system of Spirin et al. 
(1988) Science 242 : 1162 may be used to increase total yield of 
library members, or for convenience of use, if desired. A 
static in vitro protein synthesis system can be used. In this 
system, protein synthesis generally ceases after 1 h and thus 
limits the time interval for creation of the library. The 
advantage of CFCF technology is that high level and long-term 
synthesis of protein should result in a much larger and more 
diverse library of protein-RNA complexes. The CFCF technology 
has been described by Spirin and co-workers as a method for the 
high-level synthesis of protein over an extended period of 
time, 24 h or longer. In addition, CFCF technology results in 
fractionation of the newly-synthesized protein from the 
translational apparatus, and thus makes it feasible to quickly 
sequester the protein-nucleic acid complexes from polysome- 
associated nucleases and proteases. Other applications of CFCF 
technology include an efficient method for synthesizing 
peptides. For example, following the identification of a 
peptide-fusion which binds to a target with high-affinity, the 
free peptide can be synthesized directly using CFCF technology 
and used in a binding assay. 

- ntcresi ci Dinqie-cnaiJi ariLiooay oi interest, are selected 
from the library by an affinity enrichment technique. This is 
accomplished by means of a immobilized macromolecule or epitope 
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specific for the peptide sequence of interest, such as a 
receptor, other macromolecule, or other epitope species. 
Repeating the affinity selection procedure provides an 
enrichment of library members encoding the desired sequences, 
which may then be isolated for sequencing, further propagation 
and affinity enrichment. 

The library members without the desired specificity 
are removed by washing. The degree and stringency of washing 
required will be determined for each peptide sequence or 
single-chain antibody of interest and the immobilized 
predetermined macromolecule or epitope. A certain degree of 
control can be exerted over the binding characteristics of the 
nascent peptide/DNA complexes recovered by adjusting the 
conditions of the binding incubation and the subsequent 
washing. The temperature, pH, ionic strength, divalent cations 
concentration, and the volume and duration of the washing will 
select for nascent peptide/DNA complexes within particular 
ranges of affinity for the immobilized macromolecule. 
Selection based on slow dissociation rate, which is usually 
predictive of high affinity, is often the most practical route. 
This may be done either by continued incubation in the presence 
of a saturating amount of free predetermined macromolecule, or 
by increasing the volume, number, and length of the washes. In 
each case, the rebinding of dissociated nascent peptide/DNA or 
peptide/RNA complex is prevented, and with increasing time, 
nascent peptide/DNA or peptide/RNA complexes of higher and 
higher affinity are recovered. 

Additional modifications of the binding and washing 
procedures may be applied to find peptides with special 
characteristics. The affinities of some peptides are dependent 
on ionic strength or cation concentration. This is a useful 
characteristic for peptides that will be used in affinity 
purification of various proteins when gentle conditions for 
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such that a polysome scFv library can be simultaneously 
screened for a multiplicity of scFv which have different 
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binding specificities. Given that the size of a scFv library 
often limits the diversity of potential scFv sequences, it is 
typically desirable to us scFv libraries of as large a size as 
possible. The time and economic considerations of generating a 
5 number of very large polysome scFv-display libraries can become 
prohibitive • To avoid this substantial problem, multiple 
predetermined epitope species (receptor species) can be 
concomitantly screened in a single library, or sequential 
screening against a number of epitope species can be used. In 

10 one variation, multiple target epitope species, each encoded on 
a separate bead (or subset of beads) # can be mixed and 
incubated with a polysome-display scFv library under suitable 
binding conditions. The collection of beads, comprising 
multiple epitope species, can then be used to isolate, by 

15 affinity selection, scFv library members. Generally, 

subsequent affinity screening rounds can include the same 
mixture of beads, subsets thereof, or beads containing only one 
or two individual epitope species. This approach affords 
efficient screening, and is compatible with laboratory 

20 automation, batch processing, and high throughput screening 
methods . 

A variety of techniques can be used in the present 
invention to diversify a peptide library or single-chain 
antibody library, or to diversify around variable segment 

25 peptides or V H , V L , or CDRs found in early rounds of panning to 
have sufficient binding activity to the predetermined 
macromolecule or epitope. In one approach, the positive 
nascent peptide/polynucleotide complexes (those identified in 
an early round of affinity enrichment) are sequenced to 

3 0 determine the identity of the active peptides. 

Oligonucleotides are then synthesized based on these active 
peptide sequences, employing a low level of all bases 
incorporated at each step to produce slight variations of the 

»nr ■] i .^.^.nM^'' cot* ^ Hp c- o t 1 " o ^ ^ o t> ' r- ~- ' \* +- ■ 1 r. ^- ■f f r~ 1 -I ^ >-, +- l w '■ 

> Ljqmen: sequences < : - ;..:ie appropriate . oca Lions. . ri 1 l. ir.etnou 
produces systematic, controlled variations of the starting 
peptide sequences. It requires, however, that individual 
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positive nascent peptide/polynucleotide complexes be sequenced 
before mutagenesis, and thus is useful for expanding the 
diversity of small numbers of recovered complexes and selecting 
variants having higher binding affinity and/ or higher binding 
specificity. In a variation, mutagenic PCR amplification of 
positive nascent peptide/polynucleotide complexes (especially 
of the variable region sequences, the amplification products of 
which may be ligated to tether sequences and operably linked to 
an in vitro promoter) is performed and one or more additional 
rounds of screening is done prior to sequencing. The same 
genera 1 approach can be employed with single—chain antibodies 
in order to expand the diversity and enhance the binding 
affinity/specificity, typically by diversifying CDRs or 
adjacent framework regions. 

In a method of the invention, a peptide library is 
generated by in vitro synthesis in a cell-free system, wherein 
individual library members comprise a nascent polypeptide 
comprising a first polypeptide portion linked to a 
polynucleotide encoding said nascent polypeptide (or a 
polynucleotide complementary to the encoding polynucleotide 
sequence) and a second polypeptide portion having a variable 
amino acid sequence, at least in part, in peptide linkage to 
said first polypeptide portion. The nascent polypeptide is 
synthesized as a fusion protein comprising (1) a first 
polypeptide portion, termed the "tether segment", comprising a 
polypeptide sequence which binds to the encoding mRNA molecule 
serving as the translation template for the synthesis of the 
nascent polypeptide, or to a cDNA copy of such encoding mRNA, 
either directly or through binding an intermediate molecule 
that is linked directly to the encoding mRNA or cDNA copy 
thereof, and (2) a second polypeptide portion, termed the 
"variable segment", comprising one of a variety of possible 
amino acid sequence combinations represented in the library. 
Thp 1~p1~hp"r ^pcrmenl" c- o >-vp^. i j n> thp Vriri^hlp c ^pcnnpr ,+ ' ^ f* 

individual library peptide's variable segment. The linked 
polynucleotide of a library member provides the basis for 
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replication of the library member after a screening or 
selection procedure, and also provides the basis for the 
determination, by nucleotide sequencing, of the identity of the 
variable segment amino acid sequence, 

5 

Tether-Bindina Antibody Linked to Polynucleotide 
An antibody known to bind with high affinity to a 
particular peptide sequence is attached to the 5 1 or 3 1 -end of 
the RNA molecules or to the 5 1 end of a DNA primer annealed to 

10 the mRNA molecules. This can be done through an avidin-biotin 
bridge or via homo-ur heterobif unctional cross-linkers of 
amine, carboxyl, or thiol on the antibody to amine or thiol on 
the RNA or DNA primer. The sequence encoding the peptide 
epitope for this antibody is encoded by all the RNA molecules 

15 in the library as the tether segment; this tether segment 
sequence is placed either to the 5 1 or the 3 1 side of the 
variable segment sequence. During in vitro translation, the 
nascent epitope (tether segment) can bind to the attached 
antibody with an affinity high enough to allow dissociation of 

20 the polysome (by EDTA treatment, for example) and isolation of 
the intact mRNA-antibody-epitope-variable peptide complex ready 
for screening. 

A modification of this strategy comprises the 
attachment by any of the means described above of the antibody 

25 to a segment of DNA that can hybridize to the 3' end of the 

RNA. The attractive features of this scheme are: first, the 
hybrid may serve to block the dissociation of the ribosome, 
allowing more time for the more stable complex to form; second, 
the DNA segment is a generic reagent that can be prepared in 

30 large amount (with or without the attached antibody) 

independent of the construction of a particular library; third, 
the DNA can be extended after translation to provide a more 
stable form of the sequence information (DNA is generally less 



i, ;::ompieraentary UNA tragmeni- _ . ■ t . severa^ 
hundred bases is hybridized to the RNA library (e.g., a primer 
comprising a specific sequence complementary to the known 3 1 
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end of the mRNA species or oligo(dT) if the RNA comprises a 
poly (A) tail. In some embodiments, the RNA may comprise a 
polyadenylated tail, which may stabilize the RNA template in 
some in vitro translation reactions (e.g., reticulocyte 
lysate) . The complementary DNA primer may be attached to the 
antibody prior to hybridization, or it may simply be modified 
so as to bind the antibody after translation of the RNA has 
been performed. In either case, the mode of attachment may be 
one of those proposed above. By way of example, a 5'-biotin is 
attached to the DNA; an excess of streptavidin is added to 
occupy all the biotins, and the unbound streptavidin is 
removed; a biotinylated antibody is added to bind to the DNA- 
streptavidin complexes, and the excess antibody is removed. 
Note that a monovalent form of the complex can be formed using 
Fab", a bivalent complex can be formed with IgG, or a 
multivalent complex formed by adding a string of biotins to the 
DNA to bind several streptavidin molecules and consequently 
several antibodies; 

(2) The epitope sequence (tether segment) encoded by 
the mRNA is expressed and binds to the antibody. A variable 
segment peptide can be displayed with a free C-terminus if 
fused to the C-terminus of the attachment epitope (tether 
segment) or with a free N-terminus if fused to the N terminus 
of the tether segment (note that in the latter case, the N- 
terminal F-met is preferably removed by aminopeptidase or by 
treatment with a specific protease) ; 

(3) The ribosome is dissociated with EDTA; 

(4) The DNA primer is extended with reverse 
transcription (AMV reverse transcriptase under standard 
conditions) of the RNA template; 

(5) At this point the RNA may be removed with RNAse 
treatment, but this is not necessary. The library member 
consists of the displayed peptide-antibody-cDNA (and hybridized 



in the in vitro translation reaction and all steps prior to the 
synthesis of the first-strand cDNA. 
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Biotinylated Tether Segment 
A variation on the theme that avoids the use of an 
antibody substitutes a "biotinylation substrate" (BS) for the 
epitope as the tether segment, and streptavidin for the 
antibody. The biotinylation substrate is a sequence that is 
recognized by a prokaryotic enzyme, biotin holoenzyme 
synthetase (BirA) which attaches a biotin to a lysine in the 
recognition sequence. Inclusion of the enzyme in the 
translation mix, or treatment of the polysomes with the enzyme 
following translation biotinylates the nascent peptide, which 
can then bind to a streptavidin molecule attached to either end 
of the mRNA or to the small DNA primer hybridized to the RNA. 
Streptavidin may be attached to the mRNA or DNA primer by 
direct covalent linkage or via biotin moieties incorporated 
into the polynucleotides or covalently attached to the 5' end 
of the polynucleotide; however, the biotinylation of the mRNA 
(or DNA primer) preferably does not adversely affect 
translational efficiency of the mRNA template for translation 
of the tether segment or variable segment, 

Streptavidin is a bacterial protein which binds the 
water soluble vitamin, biotin, with high affinity. It is 
possible to attach biotin to RNA (or DNA) , and thus convert 
streptavidin to an RNA-binding protein through a biotin 
linkage. It is also possible to fuse heterologous proteins to 
the C-terminus of streptavidin without affecting functional 
binding to biotin. Thus, for the purpose of peptide libraries, 
it will be possible to fuse a variable segment to a tether 
segment comprising the C-terminus of streptavidin. Biotin can 
be attached to the 5' end of mRNA using chemical modification, 
or incorporated into the mRNA by _in vitro transcription using 
biotinylated nucleotide analogs. 

RNA-Binding Protein Sequence as Tether Segment 

segment, peptide sequence : provide ^ , inKage oenween tne 
peptide and the encoding polynucleotide or at least increase 
the residence time of the peptide on the mRNA and thereby 
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improve the efficiency with which the high affinity epitope- 
antibody or biotin-streptavidin complex can form. Tat or small 
peptide derivatives of Tat can be used to produce nascent 
peptide-polynucleotide complexes . 

The human immunodeficiency virus (HIV) protein Tat is 
a strong activator of viral gene transcription. The Tat 
protein stimulates transcription by binding to a specific RNA 
sequence (Tar) located at the 5' end of the Tat mRNA. There 
are several features of the Tat/Tar complex that are useful for 
the method described* For example, Tat binds Tar with 
relatively high affinity. The dissociation constant (Kd) for 
the Tat/Tar complex is 5 nM, but the inclusion of a non-ionic 
detergent reduces the Kd to approximately 100 pM. Peptides 
that bind to Tar with higher affinity may be selected by 
panning phage display peptide libraries against immobilized RNA 
comprising a Tar sequence. Further , the minimum size of Tat 
protein and Tar RNA required for binding are small and defined. 
Tat is a 86 residue protein, but only the last 24 residues of 
the carboxy terminus are required for high-affinity binding. 
The Tar stem-loop structure includes only 57 nucleotides but 
can be shortened to 27 nucleotides without affecting binding. 
Also f the conformation of the Tar RNA has been solved by NMR 
spectroscopy. Moreover, Tat binds to Tar as a fusion protein. 
There are at least two examples of functional fusions to the 
carboxy-terminus of Tat. The first is a fusion to the viral 
Rev protein, and the second is a fusion to the coat protein of 
bacteriophage MS2 . Thus, a random peptide library based on 
peptides fused to the C-terminus of Tat will function properly, 
as such fusions do not significantly adversely affect RNA 
binding. Finally, Tat binds to Tar as a monomer. This feature 
may prove useful in controlling the valency of peptide display 
by varying the number of Tar binding sites. Thus, libraries 
that are either monovalent or multivalent can be generated. 

equence: i. weexs KM ana ^ruiner^ ijK , : x , ± , rj.. ; . 
incorporated herein by reference) , can be included in the mRNA 
sequence, preferably near the translation start site, to allow 
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attachment of a Tat tether segment of the nascent peptide to 
the mRNA. The RNA-binding tether (e.g., Tat segment) will 
inhibit further translational starts on the RNA template. 

Another suitable RNA-binding protein for use as a 
tether in fusions is the iron response element binding protein 
(IREBP) , which interacts specifically with the iron response 
element (IRE) located at the 5' of the ferritin mRNA and 3' 
untranslated region of the transferrin receptor mRNA, The 
protein binds as a monomer with a dissociation constant of 20- 
50 pM (Swenson et al. (1991) Biol. Met, 4:48). The IREBP (98.4 
kDal) is active in binding to the IRE after being translated in 
vitro (Hirling et al (1992) Nucleic Acids Res. 20: 33). Thus, 
a RNA-binding tether can comprise a IREBP and the mRNA can 
comprise an IRE; the RNA-binding IREBP tether segment bind to 
an IRE sequence in each mRNA and inhibits further translational 
activity of the bound mRNA. 

Am plification. Affinity Enr ichment, and Screening 
A basic method is described for synthesizing a 
nascent peptide-polysome library and nascent single-chain 
antibody-polysome library in vitro , screening and enrichment of 
the library for species having desired specific receptor- 
binding or epitope-binding properties, and recovery of the 
nucleotide sequences that encode those peptides or antibodies 
of sufficient binding affinity for receptor or epitope (e.g., 
immobilized receptor or epitope) sufficient for selection by 
affinity selection (e.g., panning, affinity chromatography). 
Although the method is described with reference to nascent 
peptide libraries, the method is also applicable to 
synthesizing and screening nascent single-chain antibody 
libraries. 

The library consists of a population of nascent 
peptide library members comprising nascent peptides, with the 
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thr- variable 1 sequence to its own encoding 
mRNA or a cDNA copy thereof. These RNA-protein complexes (or 
DNA-protein complexes) are screened for high affinity binding 
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to a particular receptor (e.g., a peptide hormone receptor). 
After selecting those nascent peptide library members that bind 
to the ligand with high affinity, the selected complexes are 
disrupted and the mRNA (or DNA) is recovered and amplified to 
create DNA copies of the message, typically each copy comprises 
an operably linked in vitro transcription promoter (e.g., T7 or 
SP6 promoter) . The DNA copies are transcribed in vitro to 
produce mRNA, and the process is repeated to enrich for 
peptides that bind with sufficient affinity. Unlike the other 
in vitro methods that rely on intact polysomes for screening, 
the present method ; s screening of desired peptides in vitro is 
accomplished without the necessity of maintaining intact 
polysomes. Thus, many of the problems inherent to 
immunopurif ication of polysomes are avoided, and conditions 
which disrupt intact polysomes may be used for screening 
conditions, if desired. 

The following general steps are frequently followed 
in the method: (1) generate a DNA template which is suitable 
for in vitro synthesis of mRNA, (2) synthesize mRNA in vitro by 
transcription of the DNA template (s) and add to an in vitro 
translation system, (3) bind the nascent peptide tether to its 
own mRNA or a DNA primer which will hybridize to the encoding 
mRNA (and preferably prime cDNA synthesis of it), (4) screen 
the resultant nascent peptide library members for receptor- 
binding, (5) recover and amplify nascent peptide library 
members which bind the receptor and produce DNA templates from 
the selected library members competent for in vitro 
transcription . 

Each generated DNA template preferably contains a 
promoter (e.g., T7 or SP6) which is active in an in vitro 
transcription system. A DNA template generally comprises (1) a 
promoter which is functional for in vitro transcription and 
operably linked to (2) a polynucleotide sequence encoding an 
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amino- to carboxy-terminal) , (2) a polynucleotide sequence to 
which the tether can bind and/or to which a DNA primer suitable 



WO 95/1 1922 PCT/US94/12206 

57 

for priming first-strand cDNA synthesis of the mRNA can bind, 
and (3) a ribosome-binding site and other elements necessary 
for in vitro translatability of the mRNA, and optionally, for 
mRNA stability and translatable secondary structure, if any, 
5 In embodiments where the tether is a peptide which 

binds to a particular RNA sequence (e.g., Tar or a 
biotinylation sequence) , the polynucleotide sequence to which 
the tether binds is referred to as a "target site". 

The target site for the tether segment is frequently 

10 near the 5* end of the mRNA non-coding sequence, but may be 
located anywhere on the mRNA to facilitate binding to the 
nascent peptide tether segment. If the target site is located 
near the ribosome binding site, binding of the tether segment 
will preferably prevent reinitiation of translation and thus 

15 enhance the probability that only one protein per unit mRNA is 
synthesized in the system. The DNA templates of the library 
are transcribed, typically in vitro , to produce a population of 
translatable mRNA molecules encoding distinct variable segment 
sequences (i.e., a library). Frequently, the DNA templates 

2 0 comprise a T7 or SP6 promoter operably linked to the sequence 
encoding the tether and variable segments. The mRNA library 
members produced as transcription products of the DNA templates 
are then translated in vitro using an efficient in vitro 
translation system (e.g., using an £_;_ coli S30 coupled 

25 transcription-translation system) . The translation products 
are fusion proteins and may be non-terminated translation 
products (i.e., nascent peptides) attached to the encoding mRNA 
via the translating ribosome. 

The encoded fusion protein (nascent peptide) 

30 generally comprises of a tether segment fused to a variable 
segment that is frequently one member of a random library of 
peptide sequences from about 5 to 2 0 amino acids in length, but 
may be longer or shorter as discussed (supra) . The fusion 

* ' T #-■ ■ " 

rnp nonactions necessary ror optima. Dinamq to tne mRNA or 
DNA. The tether segment and the variable segment may be 
separated by a polypeptide spacer if desired; generally, such a 
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spacer is less than 500 amino acids. A single fusion protein 
(nascent peptide) may comprise multiple tether segments and/or 
variable segments, and/or spacer segments. 

For tethered nascent peptides that bind the encoding 
mRNA, it is generally important that the nascent peptide fold 
properly and bind to its own mRNA before release from the 
ribosome, or shortly thereafter in dilute conditions. This may 
be accomplished by slowing or arresting the elongation cycle of 
translation by including at the 3 1 end of the mRNA a series of 
rare codons or a hybridization sequence for an antisense primer 
(e.g., DNA , RNA , FN a) and secondary structure sequences. 

Following translation, polysomes are isolated and 
ribosomes released by the addition of EDTA sufficient to 
chelate the Mg +2 present in the buffer. Ribosomes are removed 
by high-speed centrifugation, and the RNA/protein complexes are 
screened for high-affinity receptor-binding using standard 
procedures and as described herein. 

After selecting those nascent peptide/polynucleotide 
complexes that bind with sufficient affinity, the RNA component 
is released by phenol extraction, or by changing the ionic 
strength, temperature or pH of the binding buffer so as to 
denature the nascent peptide. A cDNA copy of the mRNA is made 
using reverse transcriptase, and the. cDNA copy is amplified by 
the polymerase chain reaction (PCR) . The amplified cDNA is 
added to the in vitro transcription system and the process is 
repeated to enrich for those peptides that bind with high 
affinity. 

Alternatively, where the nascent peptide is linked to 
a DNA primer, the primer is extended by reverse transcription 
to form nascent peptide/DNA complexes prior to affinity 
screening. The residual mRNA sequences, if any, may be removed 
(e.g., by RNAse H or base hydrolysis), if desired, prior to or 
after affinity screening. 

YtM: i x. n y j c.." ~ c n diii dnntjoaies proaucea dnu ... ± a ten i , 
the method of the invention are selected to bind a 
predetermined epitope. Typically, the predetermined epitope 
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will be selected in view of its applicability as a diagnostic 
and/or therapeutic target. Several reports of the diagnostic 
and therapeutic utility of scFv have been published (Gruber et 
al. (1994) op.cit. ; Lilley et al. (1994) op.cit. ; Huston et al. 
5 (1993) Int. Rev. Immunol. 10 : 195; Sandhu JS (1992) Crit. Rev. 
Biotechnol . 12 : 437) . 

Such single-chain antibodies generally bind to a 
predetermined antigen (e.g., the immunogen) with an affinity of 
about at least 1 x 10 7 M" 1 , preferably with an affinity of 

10 about at least 5 x 10 7 M" 1 , more preferably with an affinity of 
at least 1 x 10° M -J - to 1 x 10 5 M 1 or more, sometimes up tp I x 
lO 10 *!" 1 or more.. Frequently, the predetermined antigen is a 
human protein, such as for example a human cell surface antigen 
(e.g., CD4, CD8, IL-2 receptor, EGF receptor, PDGF receptor), 

15 other human biological macromolecule (e.g., thrombomodulin, 
protein C, carbohydrate antigen, sialyl Lewis antigen, L- 
selectin) , or nonhuman disease associated macromolecule (e.g., 
bacterial LPS, virion capsid protein or envelope glycoprotein) 
and the like. 

20 High affinity single-chain antibodies of the desired 

specificity can be engineered and expressed in a variety of 
systems- For example, scFv have been produced in plants (Firek 
et al. (1993) Plant Mol . Biol. 23.: 861) and can be readily made 
in prokaryotic systems (Owens RJ and Young RJ (1994) 

25 Immunol. Meth. 168 : 149; Johnson S and Bird RE (1991) Methods 
Enzymol . 203 : 88) . Furthermore, the single-chain antibodies 
can be used as a basis for constructing whole antibodies or 
various fragments thereof (Kettleborough et al. (1994) Eur. J. 
Immunol . 24 : 952) . The variable region encoding sequence may 

30 be isolated (e.g., by PCR amplification or subcloning) and 

spliced to a sequence encoding a desired human constant region 
to encode a human sequence antibody more suitable for human 
therapeutic uses where immunogenicity is preferably minimized. 

express ici: vector ■■ iriamma i lan cei . ana puril lea tor 

pharmaceutical formulation . 
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The DNA expression constructs will typically include 
an expression control DNA sequence operably linked to the 
coding sequences, including naturally-associated or 
heterologous promoter regions. Preferably, the expression 
5 control sequences will be eukaryotic promoter systems in 

vectors capable of transforming or transfecting eukaryotic host 
cells. Once the vector has been incorporated into the 
appropriate host, the host is maintained under conditions 
suitable for high level expression of the nucleotide sequences, 
10 and the collection and purification of the mutant "engineered" 



antibodies. 



As stated previously, the DNA sequences will be 
expressed in hosts after the sequences have been operably 
linked to an expression control sequence (i.e., positioned to 

15 ensure the transcription and translation of the structural 

gene) . These expression vectors are typically replicable in 
the host organisms either as episomes or as an integral part of 
the host chromosomal DNA. Commonly, expression vectors will 
contain selection markers, e.g. , tetracycline or neomycin, to 

20 permit detection of those cells transformed with the desired 
DNA sequences ( see , e.g. , U.S. Patent 4,704,3 62, which is 
incorporated herein by reference) . 

In addition to eukaryotic microorganisms such as 
yeast, mammalian tissue cell culture may also be used to 

25 produce the polypeptides of the present invention ( see . 

Winnacker, "From Genes to Clones," VCH Publishers, N.Y., N.Y- 
(1987) , which is incorporated herein by reference) . Eukaryotic 
cells are actually preferred, because a number of suitable host 
cell lines capable of secreting intact immunoglobulins have 

3 0 been developed in the art, and include the CHO cell lines, 

various COS cell lines, HeLa cells, myeloma cell lines, etc, 
but preferably transformed B-cells or hybridomas. Expression 
vectors for these cells can include expression control 



necessary processing i n i oi nid noii ^4. i6^> , ucn a, ■:■> i iDosomt; 
binding sites, RNA splice sites, polyadeny lation sites, and 
transcriptional terminator sequences. Preferred expression 
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control sequences are promoters derived from immunoglobulin 
genes, cytomegalovirus, SV4 0, Adenovirus, Bovine Papilloma 
Virus, and the like. 

Eukaryotic DNA transcription can be increased by 
5 inserting an enhancer sequence into the vector. Enhancers are 
cis-acting sequences of between 10 to 3 00bp that increase 
transcription by a promoter. Enhancers can effectively 
increase transcription when either 5' or 3' to the 
transcription unit. They are also effective if located within 

10 an intron or within the coding sequence itself. Typically, 
viral enhancers are used, including GV40 enhancers, 
cytomegalovirus enhancers, polyoma enhancers, and adenovirus 
enhancers. Enhancer sequences from mammalian systems are also 
commonly used, such as the mouse immunoglobulin heavy chain 

15 enhancer. 

Mammalian expression vector systems will also 
typically include a selectable marker gene. Examples of 
suitable markers include, the dihydrof olate reductase gene 
(DHFR) , the thymidine kinase gene (TK) , or prokaryotic genes 

20 conferring drug resistance. The first two marker genes prefer 
the use of mutant cell lines that lack the ability to grow 
without the addition of thymidine to the growth medium. 
Transformed cells can then be identified by their ability to 
grow on non-supplemented media. Examples of prokaryotic drug 

25 resistance genes useful as markers include genes conferring 
resistance to G418, mycophenolic acid and hygromycin. 

The vectors containing the DNA segments of interest 
can be transferred into the host cell by well-known methods, 
depending on the type of cellular host. For example, calcium 

30 chloride transfection is commonly utilized for prokaryotic 
cells, whereas calcium phosphate treatment, lipofection, or 
electroporation may be used for other cellular hosts. Other 
methods used to transform mammalian cells include the use of 



nc t- expressed , me anrinoaies , , naiviauai rnurateu 
immunoglobulin chains, mutated antibody fragments, and other 
immunoglobulin polypeptides of the invention can be purified 
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according to standard procedures of the art, including ammonium 
sulfate precipitation, fraction column chromatography, gel 
electrophoresis and the like ( see , generally . Scopes, R. , 
Protein Purification , Springer-Verlag, N.Y. (1982)). Once 
purified, partially or to homogeneity as desired, the 
polypeptides may then be used therapeutically or in developing 
and performing assay procedures, immunof luorescent stainings, 
and the like ( see , generally . Immunological Methods , Vols, I 
and II, Eds. Lefkovits and Pernis, Academic Press, New York, 
N.Y. (1979 and 1981) ) . 

The antibodies of the present invention can be used 
for diagnosis and therapy. By way of illustration and not 
limitation, they can be used to treat cancer, autoimmune 
diseases, or viral infections. For treatment of cancer, the 
antibodies will typically bind to an antigen expressed 
preferentially on cancer cells, such as erbB-2, CEA, CD33, and 
many other antigens well known to those skilled in the art. 
For treatment of autoimmune disease, the antibodies will 
typically bind to an antigen expressed on T-cells, such as CD4, 
the IL-2 receptor, the various T-cell antigen receptors and 
many other antigens well known to those skilled in the art 
(e.g., see Fundamental Immunology , 2nd ed. , W.E. Paul, ed. , 
Raven Press: New York, NY, which is incorporated herein by 
reference). For treatment of viral infections, the antibodies 
will typically bind to an antigen expressed on cells infected 
by a particular virus such as the various glycoproteins (e.g., 
gB, gD, gH) of herpes simplex virus and cytomegalovirus, and 
many other antigens well known to those skilled in the art 
(e.g., see Virology , 2nd ed. , B.N. Fields et al., eds., (1990), 
Raven Press: New York, NY) . 

Pharmaceutical compositions comprising antibodies of 
the present invention are useful for parenteral administration, 
i.e. , subcutaneously , intramuscularly or intravenously. The 

■ Try T— > ." ' 1 ■■ » .. r- f -■■ v v- -> > t~ t~y . -v- -i "! — . ™ -» »-«. -i — +- v- -i V ' .-»»-. ' ~* ■-— ^ W, jr* y, ~] <i - 

iissoiveu ... al; •. •. v j opidDi t_ . ; r r : r ; . . r i'[ t ■ i d d i . u:i aqueous 
carrier. A variety of aqueous carriers can be used, e.g. , 
water, buffered water, 0.4% saline, 0.3% glycine and the like. 
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These solutions are sterile and generally free of particulate 
matter. These compositions may be sterilized by conventional, 
well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as 
5 required to approximate physiological conditions such as pH 
adjusting and buffering agents, toxicity adjusting agents and 
the like, for example sodium acetate, sodium chloride, 
potassium chloride, calcium chloride, sodium lactate, etc. The 
concentration of the mutant antibodies in these formulations 
10 can vary widely, i.e. , from less than about 0.01%, usually at 
least about 0.1% to as much as 5% by weight and will be 
selected primarily based on fluid volumes, viscosities, etc., 
in accordance with the particular mode of administration 
selected. 

15 Thus, a typical pharmaceutical composition for 

intramuscular injection could be made up to contain 1 ml 
sterile buffered water, and about l mg of mutant antibody. A 
typical composition for intravenous infusion can be made up to 
contain 250 ml of sterile Ringer's solution, and 10 mg of 

20 mutant antibody. Actual methods for preparing parenterally 

administrable compositions will be known or apparent to those 
skilled in the art and are described in more detail in, for 
example, Remington's Pharmaceutical Science , 15th Ed. , Mack 
Publishing Company, Easton, Pennsylvania (1980) , which is 

25 incorporated herein by reference. 

CDR Diversification 

The present invention enables the generation of a 
vast library of CDR-variant single-chain antibodies. One way 
3 0 to generate such antibodies is to insert synthetic CDRs into 
the single-chain antibody and/or CDR randomization. The 
sequences of the synthetic CDR cassettes are selected by 
referring to known sequence data of human CDR and are selected 

percent positional sequence iueniiiy Known v.DK sequences, 

and preferably will have at least 50 to 70 percent positional 
sequence identity to known CDR sequences. For example, a 
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collection of synthetic CDR sequences can be generated by 
synthesizing a collection of oligonucleotide sequences on the 
basis of naturally-occurring human CDR sequences listed in 
Kabat et al. (1991) op.cit. ; the pool(s) of synthetic CDR 
5 sequences are calculated to encode CDR peptide sequences having 
at least 40 percent sequence identity to at least one known 
naturally-occurring human CDR sequence. Alternatively, a 
collection of naturally-occurring CDR sequences may be compared 
to generate consensus sequences so that amino acids used at a 
10 residue position frequently (i.e., in at least 5 percent of 

known CDR seauences) are incoroorated into the svnthetic CDRs 

--------- ^ , «. 

at the corresponding position (s) . Typically, several (e.g., 3 
to about 50) known CDR sequences are compared and observed 
natural sequence variations between the known CDRs are 

15 tabulated, and a collection of oligonucleotides encoding CDR 
peptide sequences encompassing all or most permutations of the 
observed natural sequence variations is synthesized. For 
example but not for limitation, if a collection of human V H CDR 
sequences have carboxy-terminal amino acids which are either 

20 Tyr, Val, Phe, or Asp, then the pool(s) of synthetic CDR 

oligonucleotide sequences are designed to allow the carboxy- 
terminal CDR residue to be any of these amino acids. In some 
embodiments, residues other than those which naturally-occur at 
a residue position in the collection of CDR sequences are 

25 incorporated: conservative amino acid substitutions are 

frequently incorporated and up to 5 residue positions may be 
varied to incorporate non-conservative amino acid substitutions 
as compared to known naturally-occurring CDR sequences. In 
general, the number of unique oligonucleotide sequences 

3 0 included should not exceed the number of primary transf ormants 
expected in the bacteriophage-display or polysome-display 
library by more than about ten-fold. Construction of such 
pools of defined and/or degenerate sequences will be readily 

^or-npn' 1 i ^h^^ bv thn^p of nrriin^rv ^V-Mi -i p f hp art - 
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occurring CDR sequence. It is within the discretion of the 
practitioner to include or not include a portion of random or 
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pseudorandom sequence corresponding to N region addition in the 
heavy chain CDR; the N region sequence ranges from 1 nucleotide 
to about 4 nucleotides occurring at V-D and D-J junctions. A 
collection of synthetic heavy chain CDR sequences comprises at 
5 least about 100 unique CDR sequences, typically at least about 
1,000 unique CDR sequences, preferably at least about 10,000 
unique CDR sequences, frequently more than 50,000 unique CDR 
sequences; however, usually not more than about 1 x 10 6 unique 
CDR sequences are included in the collection, although 

10 occasionally 1 x 10 7 to 1 x 10 8 unique CDR sequences are 

present, especially if conservative amino acid substitutions 
are permitted at positions where the conservative amino acid 
substituent is not present or is rare (i.e., less than 0.1 
percent) in that position in naturally-occurring human CDRs, 

15 In general, the number of unique CDR sequences included in a 
library should not exceed the expected number of primary 
transf ormants in the library by more than a factor of 10. 

The broad scope of this invention is best understood 
with reference to the following examples, which are not 

20 intended to limit the invention in any manner. The following 
examples are offered by way of illustration, not by way of 
limitation. 
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EXPERIMENTAL EXAMPLES 

Oligonucleotide Sequences 

M represents A or C; K represents G or T; and N represents A, 

* 

C, T, or G) 

5 

ON1747: 

5 ' d (AAATTTCCAACGCCCTGGGTACC (MNN) 10 GCTAGCCATATGTATATCTCCTTCTT) 3 f 

or in alternative notation: 
5 1 d (AAATTTCCAACGCCCTGGGTACCMNNMNNMNNMNNMNNMNNMNNMNNMN^ 
1 0 CATATGTATATCTCCTTCTT ) 3 1 

ON3150 : 5 1 d (ACCTGGGCCATGGCCGGCTGGGCCGCAT) 3 • 

ON3149 : 5 1 d (TCTCCGGGAGCTGCATGTGTC) 3 1 

15 

ON3147 : 5 1 d (ATGCGGCCCAGCCGGCCATGGCCCAGGT) 3 1 

ON3148 : 5 1 d ( CAGTTTCTGCGGCCGCACGTTTGAT) 3 1 

2 0 ON3193 : 5 1 d ( ATCAAACGTGCGGCCGCAGAAACTGTTGAATTC) 3 ■ 

ON2970 : 5 ' d ( AATTGG AGGATCGTGCATGTG AC ) 3 1 

ON1543 : 5 1 d ( ACTTCGAAATTAATACGACTCACTATAGGGAGACCACAACGGTTTCCCTC 

2 5 TAGAAATAATTTTGTTTAACTTTAAGAAGG AGATATACAT ) 3 1 

EXAMPLE 1 

Overview 

A DNA library encoding approximately 10 12 different 

3 0 decapeptide sequences was synthesized and incubated in an 

Escherichia coli S3 0 coupled transcription/translation system. 
Polysomes were isolated by centrif ugation and added to 
microtiter wells containing an immobilized monoclonal antibody 
specific for the peptide dynorphin B a ^ a model receptor 

■ " I ., , - - I I i..-. t.J T ■ .. . .1 T I r ■ ■ ■ t h r , f L ■ ■ 
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cDNA and amplified by the polymerase chain reaction (PCR) to 
produce template for the next round of in vitro synthesis and 
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selection. A portion of the amplified template pool following 
each round was cloned and the random region sequenced. After 
four rounds of affinity selection, the majority of clones 
contained a consensus sequence that was similar to the known 
5 high-affinity epitope for the antibody. Peptides corresponding 
to several of these sequences were synthesized and found to 
have binding affinities ranging from 7 to 140 nM. The in vitro 
polysome system described here is capable of screening peptide 
libraries that are three to six orders of magnitude larger than 
10 current biological peptide expression systems. 

diversity is the transformation frequency of the bacterial host 
which for coli is between 10 7 to 10 9 total transf ormants . 
Depending on the length of the peptide, this may result in a 

15 small fraction of the total combinatorial possibilities that 
can be screened. For example, the number of possible peptide 
sequences for a ten residue peptide is (20) 10 or 1.0 X 10 13 , and 
the number of possible decacodon sequences (i.e., encoding 
nucleotide sequences) is 8.2 x 10 14 . Thus, for a library of 

20 10 9 independent transf ormants, only a small fraction (0.01%) of 
the possible sequences typically can be screened for binding. 
In addition, other factors such as proteolysis and defective 
secretion could potentially affect the diversity of peptide 
sequences that are expressed in vivo . 

25 To create a recombinant peptide library that was not 

limited by the transformation frequency of cells, an in vitro 
polysome system was developed (described infra) . A monoclonal 
antibody (mAb) (D32.39) which binds dynorphin B, a 13-residue 
opioid peptide was selected as a model receptor. Previous 

3 0 studies had shown that a six amino acid fragment of dynorphin 
B, Arg-Gln-Phe-Lys-Val-Val (RQFKW) defines the linear epitope 
for the D3 2.3 9 mAb. A polysome library was generated 
containing 10 12 random decapeptide (decacodon) sequences and 



vitro synthesis and selection. After just four rounds of 
selection, the majority of peptides contained within the pool 
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shared a consensus sequence which was similar to the epitope 
sequence. All of these peptides bound specifically and with 
high-affinity to the antibody. 

Construction of a Synthetic Gene for Expression of Nascent 
Peptides in vitro . 

A gene for expressing nascent peptides in vitro was 
constructed. The coli S3 0 system was used for in vitro 
expression of the construct genes; it translates mRNA with 
high-efficiency, is well-characterized, and is a coupled system 
that supports both transcription and translation. 

A synthetic gene for expressing N-terminal peptides 
under the transcriptional control of the bacteriophage T7 
promoter was constructed. Oligonucleotide cassettes were 
synthesized and ligated to unique restriction sites of the T7 
expression plasmid, pT7-7 (Fig. 1). Cassettes encoding the 
D32.39 epitope sequence (M AROFKW T. epitope sequence 
underlined) or a scrambled, non-binding control sequence 
(MAVFKRTVQ) , were ligated in-frame to a repeating Gly-Ser 
coding region (Fig. 1). When these plasmids are linearized 
with Hindlll prior to in vitro synthesis, the predicted gene 
product is a protein of 9 3 residues with either the epitope or 
control sequences beginning at amino .acid position 3 (Fig. 1) . 
There are no stop codons in any of the three possible reading 
frames. Fig. 3, panels (a) and (b) , shows construction of a 
DNA library containing a random population of decacodon 
sequences. The degenerate region was constructed by annealing 
100 pmoles of oligonucleotides ON1543 (containing the T7 
promoter (P T7 ) and ON 1747 and extending in a reaction 
containing 104 units Sequenase (United States Biochemical) , ImM 
dNTP, and 10 mM DTT for 3 0 minutes at 37 °C. The extended 
product was cleaved with BstXI, ethanol precipitated, and 
resuspended in water. The BstXI fragment containing the Gly- 
Ser coding region shown on the riaht CFia. 1) was prepared bv 

■ . . ■* 

^j-Le unKers Deiween tne hinciill/Clai sites ana the Ndel/EcoRI 
sites of pLM142. Approximately 4 /ig of the Gly-Ser fragment 
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was ligated to an equivalent amount of the degenerate region in 
a reaction containing 400 units T4 ligase, 50 mM Tris, pH 8.0, 
10 mM DTT, 1 mM ATP, and 25 /xg/ml BSA for 16 hours at 15°C. 
The 4111 bp ligated product (Mr, 267 JcDa or 2 . 5 x 10 12 
5 molecules/mg) was gel purified and ligated. 

Alternatively, the 89 bp region flanked by BamHI and 
Sail restriction sites (Fig. 1) was replaced with a 155 bp 
segment from the gene pill sequence and the resulting spacer 
sequence was shown to be a superior template for PGR 

10 amplification. Plasmid pLM182 appears to contain a superior 

Gly—Ser region for PGR amplification than pLM14 5 which is shown 
in Fig. 3. Plasmid pLM182 is identical to pLM145 except that 
the BamHI /Sal fragment shown in Fig. 1 is replaced with the 155 
bp sequence from pill. The BstXI sites of both plasmids are 

15 encoded by linkers that were cloned between the Ndel/EcoRI 

sites and Hindlll/Clal sites ( see Fig. 1) . The sequence of the 
Ndel/EcoRI linker is: CATATG GGTAC CCAGGGCGTTGG T GAATTC (Ndel, 
BstXI, and EcoRI sites are underlined) . The 155bp 
polynucleotide sequence is shown below with the Bam HI and Sail 

20 sites underlined: 

GGATCCCAGTCGGTTGAATGTCGCCCTTATGTCTTTGGCGCTGGTAAACCATATGAATT 
TTCTATTGATTGTGACAAAATAAACTTATTCCGTGGTGTCTTTGCGTTCTTTTATATGT 
TGCCACCTTTATGTATGTATTTTCGACGTTTCGACGTTTGCTAACATACTGTCGAC 
To generate this fragment by PCR, we used ON2453 (5 1 - 

25 TATGGGTACCCAGGGCGTTGGTG-3 1 ) as the 5 1 primer which overlaps the 
BstXI site and ON1230 (5 1 -GGCGCCTGCTGCCTGCGTGTCGCCTGTCGT-3 1 ) as 
the 3 1 primer which hybridizes to a region between the Sail ans 
Hindlll sites as shown in Fig. 1. 

Efficient spacer sequences are provided by various 

30 means. For example, the Gly-Ser segment can be mutagenized 
(e.g., randomly or pseudorandomly) to generate mutagenized 
sequence between the EcoRI and BamHI sites. In one aspect, a 
population of library members having a displayed peptide (or 
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binding to the target macromolecule and library members bound 
to the target are isolated. The population of isolated library 
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members are enriched for those library members having enhanced 
binding affinity for the target, and are enriched for spacer 
sequences which are compatible with the higher binding affinity 
and/or which are efficiently amplified by Taq polymerase (or 
other PCR-compatible polymerase) and/ or because a sequence in 
the mutagenized portion was efficient in stalling ribosomes 
(perhaps resulting in multivalent display) and causing an 
increase in recovery of the polysomal mRNA. Generally, after 
one or more rounds of such affinity selection, the sequence (s) 
of the selected spacer sequence (s) is/are determined. 

In Vitro Synthesis and Isolation of Polysomes . 

The E\. coli S3 0 extract (Promega) was prepared from 
the B strain SL119 as described (Zubay G (1973) Ann . Rev . 
Genet. l_i 267) . Synthesis reactions were contained in a final 
volume of 50 ill and included 20 /il of complete premix or premix 
lacking methionine for radiolabeling protein (Lesley et al. 
(1991) J. Biol. Chem. 266 : 2632), 15 /xl of extract, 1 ix± of 
rifampycin (1 mg/ral) , 100 units of T7 RNA polymerase (Ambion) , 
2 0 units of RNasin (Promega) and DNA as indicated. Reactions 
were incubated for 3 0 min at 3 7°C and synthesis was stopped by 
placing on ice and diluting four-fold with polysome buffer (20 
mM Hepes-OH pH 7.5/10 mM MgCl 2 /1.5 /ig/ml chloramphenicol/ 100 
Mg/ml acetylated bovine serum albumin (BSA) /I mM dithiothreitol 
(DTT)/20 units/ml RNasin/0.1% Triton X-100) . An alternative 
polysome buffer comprises 10 mM sodium phosphate, pH 7.4, 5 mM 
MgCl 2 , 1 mM DTT, 0.85% Tween, 1.5 /zg/ml chloramphenicol, 0.1% 
BSA, and 20 units/ml RNasin. To radiolabel mRNA or protein, 5 
/xCi of a-[ 32 P]UTP (Amersham, 3000 Ci/mmole) or [ 35 S] methionine 
(Amersham, 617 Ci/mmole) was included in the reaction and the 
incorporation of label was quantitated by precipitating 
duplicate samples with trichloroacetic acid (TCA) , counting in 
a liquid scintillation counter and averaging the values. To 
isolate polysomes, the diluted reactions were centrifuaed at 

* ■ ... 

tor b min to remove any insoluble material. To measure the 
incorporation of mRNA into polysomes, equal amounts of 32 p- 
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labelled mRNA from a reaction were diluted in polysome buffer 
or elution buffer (polysome buffer plus 20 mM EDTA) and 
centrifuged as described above. The fraction of total mRNA 
which was specifically released from polysomes by EDTA was 
determined by TCA precipitation. Fig. 5 shows the effect of DNA 
library concentration on protein synthesis in vitro. 



10 



15 



Affinity selection of polysomes . 

Dynal beads were prepared according to the 
manufacturer. Microtiter wells were prepared for polysome 
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in PBS (10 mM sodium phosphate pH 7.4/120 mM NaCl/2.7 mM KC1) 
for l hr at 37 °c, washing with PBS, blocking with PBS/1% 
nonfat milk for 1 hr at 3 7 °C and washing again with polysome 
buffer. Polysomes, as indicated, were incubated with the 
antibody for 2 hr at 4 °C. Each well was washed five times 
with 100 Ml of polysome buffer and the mRNA was recovered in 
100 Ml of elution buffer after incubating for 30 min at 4°C. 



20 Specific Binding of Polysomes to mAb D32.39 . The fraction of 

polysomes capable of binding specifically to mAb D32.39 via the 
nascent peptide was determined. Plasmids encoding the epitope 
(pLM138) or control sequences (pLM142) were linearized with 
Hindlll, and incubated in separate S3 0 reactions containing a- 

25 [ 32 P]UTP to label the newly-synthesized mRNA. Translation 
elongation was stopped by adding chloramphenicol and the 
reactions were centrifuged at high-speed to pellet polysomes 
and free ribosomal subunits. Radiolabeled polysomes 
containing the epitope or control coding sequences were added 

30 to separate microtiter wells containing the immobilized D32.39 
mAb. Following binding and washing to remove unbound 
polysomes, EDTA was added to dissociate the complexes and the 
labelled mRNA was recovered. The amount of mRNA recovered from 
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binding was blocked by the prior addition of free dynorphin B 
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peptide to the wells (Fig. 2B) . Binding of polysomes to mAb 
D32.39 is peptide-specif ic. 

The binding study demonstrates that 1-2% of polysomal 
mRNA encoding the epitope is recovered from the antibody. This 
low recovery is not caused by inefficient release of mRNA from 
the antibody since equal amounts of mRNA were recovered with 
phenol extraction or EDTA addition. The possibility that poor 
binding is caused by inefficient capture of polysomes by D32.39 
immobilized in the microtiter well was evaluated. Unbound 
polysomes were removed from the microtiter well following the 
bindincr step and added to a fresh well containina immobilized 
D32.39. This was repeated with identical conditions for a 
third well. From all three wells, approximately the same 
percentage of input polysomal mRNA (1%) was recovered. Thus, 
at least 3%, and probably a much greater percentage, of 
polysomes containing the epitope are capable of binding the 
maximum percentage of binding was not determined. Alternative 
immobilization matrices such as beads or mini columns for 
improving the efficiency of polysome capture can be used. 

Screening of a polysome library . 

Polysomes were isolated from a reaction programmed 
with 440 ng of DNA library and equal portions were added to six 
microtiter wells containing the immobilized mAb D32.39. 
Following affinity selection, the recovered mRNA samples were 
combined and treated with 6 units of DNase I (Ambion) for 15 
min at 37 °C after raising the MgCl 2 concentration to 40 mM. 
The mRNA was phenol extracted, ethanol precipitated in the 
presence of glycogen and the pellet was resuspended in 20 /il of 
RNase-free water. A portion of the mRNA (8.5 jxl) was heated 
for 3 min at 80 °C, chilled on ice and 50 pmoles of primer 
ON1914 (5 1 GATTGTGGAAGCTTGGCGCCTGCT 3') were added to 
synthesize cDNA using the AMV reverse transcription system 
(Promega) . The cDNA was amplified by PCP in a reaction 

ot Tag polymerase (Promega;, ana O.b /iM each ot primer ON1415 
containing the T7 promoter (5* 
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ACTTCGAAATTAATACGACTCACTATAGGGAGACCACAACGGTTTCCCTCT 3 1 ) and 
primer ON123 0 (5 1 GGCGCCTGCTGCCTGCGTGTCGCCTGTCGT 3*). 

Amplification consisted of 30 cycles of denaturation at 95°C 
for 45 sec, and annealing/extension at 72°C for 1 min. The 
amplified product was gel purified and quantitated by measuring 
the A 26 q. 

DNA sequencing . 

Subcloning of the DNA pool to the phagemid vector, 
pAFF6, for sequencing and ELISA is shown schematically in Fig. 

A "n rr 1 o ef ranHoH nh a rrom i ^ PINT ft x.ra c i e>Alafrt^ "U* r r»-v.^^._ * _r» 

(Biorad) and the random region was sequenced using the 
Sequenase system (United States Biochemical) . 

ELISA * 

To measure phage binding by ELISA, the microtiter 
wells were prepared as described above except that 1 jig of mAB 
D32.39 per well was used and the blocking buffer consisted of 
PBS/1% BSA. Duplicate portions of phage supernatant (50 jzl) 
were added to wells and incubated for 2 hr at 4°C. Wells were 
washed with 50 volumes of PBS, and 100 /xl of horseradish 
peroxidase conjugated to sheep anti-M13 IgG (1:2000 dilution, 
Pharmacia) was added and incubated for 1 hr at 4°C Wells were 
washed with PBS and binding was detected by adding substrate 
(0.2 mg/ml 2 • , 2 ■ -azino-bis (3-ethylbenzthiazoline-6-sulphonic 
acid) diammonium/50 mM citric acid pH 4/0.05% hydrogen 
peroxide) and measuring the A 405 . The positive and negative 
controls were phage expressing the D32.39 epitope and control 
peptides, respectively. Phage clones were scored as positive 
if the average A 405 value was at least two-fold greater than 
that obtained for binding to wells not coated with the mAb or 
to wells preincubated with 10 fiM dynorphin B peptide prior to 
adding phage. 

epLiue^ were -\ r ::neb 1 1 eu Hiz:* di, ^pp^ieu D.osystems 
model 431A peptide synthesizer using Fmoc-protected amino 
acids. The peptides were purified to greater than 90% purity 
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by HPLC and confirmed by mass spectroscopy. The competition 
binding assay was performed and included a low concentration 
(50 pM) of the tracer peptide containing the D32.39 epitope 
sequence . 



RESULTS 

Screening of a Polysome Library . 

Polysomes expressing a library of peptides were 
screened for binding to the D32.39 mAb. An in vitro system was 
programmed with DNA containing 10 12 different decacodons, 
incubated, and polysomes were isolated and added to microtiter 
wells containing the immobilized mAb D32.39. Following 
affinity selection, the bound mRNA was recovered and copied to 
cDNA using reverse transcriptase and amplified by PCR using 
primers that included the sequences for the promoter and leader 
regions of T7 RNA polymerase. A portion of the amplified DNA 
product was then added to the S3 0 system for a subsequent round 
of in vitro synthesis and affinity selection. 

After each round of selection, a portion of the 
amplified DNA template was subcloned to pAFF6 and the random 
region was sequenced. The sequences of selected clones 
isolated from rounds 2, 3, 4, and 5 all bear similarity to the 
known six-residue epitope and related sequences identified in 
previous studies (Fig. 6) . The most highly-conserved residues 
are an invariant arginine at position one and phenylalanine at 
position three. The majority of the clones (52%) contain the 
positively charged residues lysine, arginine or histidine at 
position 4. The aliphatic residues valine, isoleucine, leucine 
and alanine are the most frequent group of amino acids found at 
positions 5 (76%) and 6 (71%) with valine the preferred 
residue. No strong bias was evident for residues in the second 
position. 

The binding specificity of these peptides for the 



terminal sequence of the processed recombinant pill is 
identical to the polysome-derived sequence for the first 14 
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residues. Each of the 21 unique peptide clones was tested for 
binding to D32.39 using a phage ELISA. All of the clones were 
positive in the ELISA test except for sequences HNEGIRMFRVV, 
GMYETRLFHVG and FSE RRF5VC W (epitope-like sequence underlined) 
5 These three contain 29 nucleotides in the random nucleotide 

region instead of 3 0 resulting in a frameshift mutation of the 
pill fusion; the frameshifts are an artifact of subsequent 
cloning manipulations. 

10 Binding Affinities of Enriched Peptides for itiAb D32.39 . 

Peptides corresponding to some of the enriched 
sequences were chemically synthesized, purified and their 
identity confirmed by mass spectrometry. A competition binding 
assay was used to estimate their affinity for the mAb D32.39 

15 and under the conditions of the assay, the IC 50 value should 

approximate the K d . Six peptides were assayed and the binding 
affinities range from 7.2 to 140 nM (Fig. 6). For comparison, 
the authentic dynorphin B peptide had an IC 50 of 0.29 nM in 
this assay. 

20 The immense size of the polysome library reported 

here, 10 12 members, is a direct result of the complete in vitro 
synthesis of DNA template, mRNA, and nascent peptide. By 
avoiding bacterial transformation, the typical size of a 
conventional, recombinant library (10 7 -10 9 members) is exceeded 

25 by several orders of magnitude. For certain random peptides 

such as octapeptides or nonapeptides comprising 2.6 X 10 10 and 
5.1 X 10 11 possible sequences, respectively, screening by 
polysomes may be the only currently available system for 
sampling the complete repertoire of combinatorial 

30 possibilities, or at least a substantial portion thereof. With 
appropriate modifications, it is possible to further increase 
the size of the library by increasing the translational 
capacity of the cell-free system. Such modifications include 
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In addition to larger libraries, the potential 
diversity of peptides expressed in vitro is also greater than 
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conventional systems. Many cellular processes which limit in 
vivo expression such as defective secretion and proteolysis are 
absent or diminished in a cell-free system. Further diversity 
is possible by including additional building blocks and 
5 incorporating non-naturally occuring amino acids into peptides 
using methods already established for the coli S30 system. 
Finally, diversity is not affected by the translational reading 
frame of the N-terminal nascent peptide. Three of the enriched 
sequences (HNEGIRMFRW, GMYETRLFHVG , FSERRFSVCW) contain 29 

10 nucleotides in the random region instead of 3 0 resulting in a 
f rameshif t of the downstream ceding region. We confirmed that 
one of these peptides (FSERRFSVCW) is capable of binding to the 
mAb D32.39 with high affinity (110 nM) . Thus, it is possible 
to enrich for peptide sequences of varying lengths despite 

15 changes in the reading frame of the synthetic gene which were 
constructed . 

All of the peptides isolated by the polysome system 
bound to the D32.39 mAb with high affinity (7-140 nM) , despite 
the existence of low affinity peptides for this mAb. One 

20 possible explanation for this is that polysome display is 
monovalent and only one initiation event occurs per mRNA 
molecule. This may explain why certain peptides such as clone 
505 (PIMRSFKWL) which had the highest affinity of the peptides 
tested (7 nM) was overrepresented in clones sequenced from the 

25 later rounds of enrichment. Selective enrichment of high 
affinity peptides synthesized in vitro has important 
consequences. It is possible to include mutagenesis with each 
round of template amplification and achieve directed evolution 
of peptide ligands in a manner similar to that applied to 

30 ribozymes. 

The jLn vitro polysome system can also be used for 
studying the role of mRNA sequence on translational pausing. 
The antibiotic chloramphenicol was used to arrest translation 

• iseu '. ■. . , . . . u^.ljpi • J c- ; , ... • ......... . ^ ± y o m e ^ 

containing efficient pausing sequences. 
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EXAMPLE 2 

Screening Single-Chain Antibodies with Nascent Polysome Method 

In this example, nascent single-chain antibodies on 
polysomes are constructed and expressed in a polysome system. 
5 The displayed scFv fragments exhibit binding specificity and 
affinity for antigen. Compared to bacteriophage antibody- 
display systems, the present polysome scFv display technology 
enables the construction and screening of libraries that are 
about 3 to 6 orders of magnitude larger than current antibody 
10 display techniques in the art. Furthermore, many problems 
associated with in vivo prckaryctic display systems (e.g., 
proteolysis, insoluble inclusion bodies, defective secretion) 
are avoided. 



15 Construction of Plasmids Encoding scFv for DT or Antibody 179 

Two single-chain antibody genes (scFv) specific for 
diphtheria toxin (DT) and antibody 179 were isolated from human 
spleen and mouse hybridoma, respectively, using the Pharmacia 
Recombinant Phage Antibody System (Pharmacia Biotech, Alameda, 

20 CA) . The antibody genes are carried by the plasmid vector 
p CANTAB 5 E (Pharmacia) and are flanked by unique Sfil/Notl 
restriction sites. Each antibody coding sequence is also fused 
at the carboxy terminus to a 13 -amino acid E-tag epitope 
sequence. To measure the specificity of antibody binding by 

25 ELISA, a 2 kb Sfil/EcoRI fragment from the CANTAB 5 E clone 

carrying the DT antibody gene was ligated to the same sites of 
a derivative of pLM139 resulting in plasmid pLM169. This 
plasmid contains the DT antibody gene under the transcriptional 
control of the bacteriophage T7 promoter. Plasmid pLM169 was 

30 linearized with EcoRI prior to adding to the in vitro 

transcription/translation system. To measure binding of 
antibodies displayed on polysomes, the 750 bp Sfil/Notl 
fragments from the pCANTABSE clones carrying the DT antibody 



■ especiivei \ . : ; .otn uasmia^ were 
to adding to the in vitro system, 
portrays the plasmid constructs. 
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Fig. 7 schematically 
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In vitro expression of antibodies 

The E. coli S30 extract (Promega) was prepared from 
the B strain SL119. Synthesis reactions were contained in a 
final volume of 50 pi and included 20 pi of complete premix, 15 
5 pi of extract, 1 Ml of rifampicin (1 rog/rol) , 100 units of T7 
RNA polymerase (Amnion) , and 1.5 Ml of template DNA as 
indicated. Reactions were incubated at 37 °C for either 30 min 
to isolate polysomes or 60 min to synthesize soluble antibody. 
To radiolabel mRNA, 10 MCi of or-[ 33 P]UTP (Amersham, 3000 
10 Ci/mmole) was included in the reaction and the incorporation of 
*i ~i j ^ -r,-,~ ^^nf ifafoH H\r nrprinil-af inrr riiml icate samnles 

j_ auiuiau u ^ nu'J %i wu*i j. i-— »- — *— ~ j r~ * *. 

with 10% trichloroacetic acid (TCA) , counting in a liquid 
scintillation counter and averaging the values. 

15 Determination of soluble antibody bind ing bv ELISA 

The binding specificity of a soluble antibody 
synthesized in vitro was determined by ELISA. In vitro 
reactions were incubated in the presence or absence of pLM169 
for 60 min, and then diluted ten-fold with cold PBS (10 mM 

20 sodium phosphate pH 7.4, 14 0 mM NaCl, 2.7 mM KCl)/0.05% Tween- 
20 and placed on ice. Microtiter wells (Corning) were prepared 
by incubating each well with a 1 pg of diphtheria toxin 
(Calbiochem) or bovine serum albumin (BSA) in PBS for 1 hr at 
37 °C, washing with PBS, blocking with PBS/1% BSA for 1 hr at 

25 37 °C and washing again with PBS. Duplicate portions of the 

diluted in vitro reactions (100 pi) were added to the wells and 
incubated for 1 hr at 4°C. Wells were washed 5 times with 250 
Ml of PBS, and the primary antibody (anti E-tag (Pharmacia)), 
100 pi at 1 pg/nl in PBS/0.1% BSA/ 0.1 % Tween) was added and 

30 incubated for 1 hr at 4°C and washed as before. The plate was 
developed by adding 100 pi of alkaline phosphatase-conjugated 
goat anti-mouse antibody (Gibco, 1:1000 dilution in PBS/0.1% 
BSA), incubating for 1 hr at 4°C, washed as before, and treated 
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averaged. Fig. 8 graphically depicts the results. 
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Polysome isolation and binding of antibodies 
displayed on polysomes (Fig. 3) . To isolate polysomes, the in 
vitro reactions were incubated with either pLM166 or pLM153 and 
the reactions were stopped by placing on ice and diluting four- 
fold with polysome buffer (2 0 mM Hepes-OH pH 7.5, 10 mM MgCl 2 , 
1.5 Mg/ml chloramphenicol, 100 /ig/ml acetylated bovine serum 
albumin (BSA) , 0.1% Tween-20) . The diluted reactions were 
centrifuged at 288,00 x g for 36 min at 4°C and the pellets 
were resuspended in polysome buffer and centrifuged a second 
time at 10,000 x g for 5 min to remove any insoluble material. 

The labeled polysomes were quam.ii.ai.cu u 3 ^v.--^- ---- 

46,000 cpm of each polysome preparation was added to 150 jig of 
magnetic beads (tosyl activated, Dynal) that had been coated 
with either 0.75 ng of diphtheria toxin (Calbiochem) or Abl79 
(kindly provided by Bruce Mortensen) as the negative control. 
After binding for 1 hr at 4°C with end over end turning, the 
beads were washed five times with polysome buffer and the mRNA 
was eluted in 100 nl of elution buffer (Polysome buffer 
containing 2 0 mM EDTA) . The recovered mRNA was TCA 
precipitated and the radioactive counts determined, as shown in 

Fig. 9. 

To facilitate correct folding of single-chain 
antibodies on polysomes, it is frequently desirable to incubate 
the polysomes in the presence of chaperones (e.g., GroEL or 
DnaK) prior to the binding (panning) step. To facilitate 
formation of disulfide bonds which are required for proper 
folding of a single-chain antibody, it is often desirable to 
incubate the polysome preparation in the presence of 0.2 mM 
glutathione (GSSG) , 2 mM reduced glutathione (GSH) , and 1 pM 
protein disulfide isomerase (PDI) for 15 minutes at 25-30°C 
prior to adding the target macromolecule (or small molecule 
epitope), and conducting the binding step at approximately 4°C. 

Fig. 11 shows construction of a single-chain antibody 
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were amplified separately by PCR using the indicated primer 
sets. Equimolar portions were mixed and joined by PCR overlap 
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in the absence of primers. The full length segment was then 
amplified using primers ON3149 and ON2970. 

The foregoing description of the preferred 
embodiments of the present invention has been presented for 
5 purposes of illustration and description. They are not 

intended to be exhaustive or to limit the invention to the 
precise form disclosed, and many modifications and variations 
are possible in light of the above teaching. 

Such modifications and variations which may be 
10 apparent to a person skilled in the art are intended to be 
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CLAIMS: 

1 1. A method for identifying single chain antibodies 

2 that bind to a predetermined antigen, said method comprising: 

3 (1) translating in vitro a population of mRNA 

4 molecules encoding a single-chain antibody segment with a 

5 prokaryotic in vitro translation system forming a population of 

6 polysomes displaying nascent single-chain antibodies; 

7 (2) contacting said population of polysomes 

8 with a predetermined antigen under aqueous binding conditions 

9 compatible with intact polysomes and suitable for specific 

10 antigen-anrxooay Dinain^ , uiei e^y iwiiumy a^u.**^ r^-'-j _w^w^ 

11 comprising polysomes displaying single-chain antibody bound to 

12 antigen; 

13 (3) separating intact polysomes bound to the 

14 antigen from unbound polysomes by washing the bound polysomes 

15 with a wash solution and removing unbound polysomes; 

15 (4) dissociating the bound polysomes and 

17 synthesizing cDNAs from the mRNA of the dissociated polysomes; 

18 and 

19 (5) thereby identifying the cDNAs as encoding 

20 single chain antibodies that bind to the predetermined antigen. 

1 2. A method of claim 1, wherein the prokaryotic in 

2 vitro translation system is a coupled transcription/translation 

3 system. 

1 3. A method of claim 1 or 2 , wherein the 

2 prokaryotic in vitro translation system is an E* coli S3 0 

3 system. 

1 4. A method of claim 1, wherein the single chain 

2 antibody is a scFv comprising a V H , spacer peptide, and V L - 
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1 6. A method of claim l, comprising the further step 

2 of cloning the cDNAs into a bacteriophage display vector. 

1 7. A method of claim 6, wherein the bacteriophage 

2 display vector is a filamentous bacteriophage vector and the 

3 single chain antibody sequence isolated by polysome screening 

4 is expressed as a fusion with a coat protein selected from pill 

5 or pVIII. 

1 8. An improved method for identifying single chain 

2 antibodies sequences that bind to a predetermined antigen, 

3 comprising: 

4 (1) constructing in vitro a library of DNA 

5 templates encoding a V H linked through a spacer peptide segment 

6 to a V L suitable for in vitro transcription; 

7 (2) introducing said library of DNA templates 

8 directly into an in vitro transcription/translation system; 

9 (3) transcribing said library of DNA templates 

10 in vitro forming a population of mRNA molecules; 

11 (4) translating in vitro said population of 

12 mRNA molecules forming polysomes displaying nascent peptides 

13 comprising single chain antibodies; 

14 (5) contacting said polysomes with the 

15 predetermined under conditions suitable for specific binding 

16 and compatible with intact polysomes; 

17 (6) selecting polysomes which are bound to the 

18 predetermined antigen and removing unbound polysomes by washing 

19 with a suitable wash buffer and recovering polysomes bound to 
2 0 the predetermined antigen; 

21 (7) dissociating the bound polysomes and making 

22 cDNAs from the mRNAs of the bound polysomes; and 

23 (8) isolating the cDNAs. 
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10. A method of claim 9, wherein the in vitro 
transcription and translation is performed with an E. coli S30 
system . 

11. A method of claim 9, comprising the further step 
of cloning the cDNAs into a bacteriophage display vector and 
performing affinity screening on a resultant library of 
bacteriophage particles displaying single chain antibodies 
encoded by the cDNAs and fused to a bacteriophage coat protein. 



12. A method ot claim », wueieiii a<- x 
subsequent round of screening comprises polysomes displaying an 
average display valency that is higher than average display 
valency of polysomes in a first screening round. 

13. The method of claim 8, comprising the further 
step of cloning the cDNAs into pAFF6 and forming bacteriophage 
particles displaying the single chain antibody sequences 
encoded by the cDNAs , affinity screening the bacteriophage 
particles, and sequencing polynucleotides isolated from 
bacteriophage particles bound to an immobilized antigen used 
for said affinity screening. 

14. A composition comprising a population of 
polysomes displaying nascent single chain antibodies, wherein 
each nascent single chain antibody comprises a tether segment 
and a scFv segment. 

15. A composition of claim 14, wherein the scFv 
segment comprises a V H , spacer peptide, and V L . 

16. A composition consisting essentially of 
polysomes displaying nascent peptides comprising a single chain 
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specificities of single chain antibodies, said method 
comprising: 
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contacting under suitable binding conditions a 
multiplicity of antigen species with a polysome library 
displaying nascent peptides having a single chain antibody 
segment; and 

separating polysomes bound to the antigen species 
from polysomes not bound to the antigen species; 

synthesizing cDNA from the separated bound polysomes, 
thereby identifying single chain antibodies which bind to at 
least one of the antigen species present in the multiplicity of 
antigens. 

18. A method of claim 17, wherein the multiplicity 
of antigen species comprises a library of beads or pins, each 
bead or pin having a single species of predetermined antigen. 

19. A method of claim 18, wherein the antigen 
species comprise polypeptides synthesized on the beads or pins. 

20. A method of claim 19, wherein the beads or pins 
individually comprise a discrete tag capable of reporting a 
seguence identity of the single species of polypeptide present 
on the bead or pin. 

21. An improved method for identifying peptide 
sequences that bind to a predetermined receptor, said improved 
method comprising: 

(1) translating a population of mRNA molecules 
in vitro with a prokaryotic in vitro translation system forming 
a population of polysomes displaying nascent peptides; 

(2) contacting said population of polysomes 
with a predetermined immobilized receptor under aqueous binding 
conditions compatible with intact polysomes and suitable for 
specific binding; 
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(4) dissociating the bound polysomes and 
synthesizing cDNAs from the mRNA of the dissociated polysomes; 
and 

(5) determining sequences of the cDNAs. 

1 22. A method of claim 21, wherein the prokaryotic in 

2 vitro translation system is a coupled transcription/translation 

3 system. 

1 23 . A method of claim 22, wherein the prokaryotic in 

2 vitro translation system is an E . coli S30 system. 

1 2 4. A method of claim 21 wherein the aqueous binding 

2 conditions and/or the wash solution comprise a non-ionic 

3 detergent . 

1 25. A method of claim 21, wherein the aqueous 

2 binding conditions comprise a preblocking agent selected from 

3 the group consisting of nonfat milk, bovine serum albumin, 

4 tRNA, and gelatin. 

1 26. A method of claim 21, comprising the further 

2 step of cloning the cDNAs into a bacteriophage display vector. 

1 27. A method of claim 26, wherein the bacteriophage 

2 display vector is a filamentous bacteriophage vector and the 

3 nascent peptide sequence isolated by polysome screening is 

4 expressed as a fusion with a coat protein selected from pill or 

5 pVIII. 

1 28. An improved method for identifying peptide 

2 sequences that bind to a predetermined receptor, said improved 

3 method comprising: 

' ' • . . *■ • »> . . " *- • ^ * ♦ ■» .... - _. 
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(2) contacting said population of polysomes 
with a predetermined immobilized preblocked receptor under 
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aqueous binding conditions compatible with intact polysomes and 

10 suitable for specific binding; 

X1 (3) separating intact polysomes bound to the 

12 immobilized receptor from unbound polysomes by washing the 

13 bound polysomes with a wash solution; 

14 (4) dissociating the bound polysomes and 
synthesizing cDNAs from the mRNA of the dissociated polysomes; 

16 and 

17 (5) determining sequences of the cDNAs . 

1 29. A metnuu ui t-xaj-m f ^ 

2 immobilized preblocked receptor has been preblocked with a 

3 preblocking agent selected from the group consisting of: nonfat 

4 milk, bovine serum albumin, tRNA, and gelatin. 

1 30. An improved method for identifying peptide 

2 sequences that bind to a predetermined receptor, comprising: 

3 (1) constructing in vitro a library of DNA 

4 templates suitable for in vitro transcription; 

5 (2) introducing said library of DNA templates 

6 directly into an in vitro transcription/translation system 

7 without cloning said library in host cells; 

8 (3) transcribing said library of DNA templates 

9 in vitro forming a population of mRNA molecules; 

10 (4) translating in vitro said population of 

11 mRNA molecules forming polysomes displaying nascent peptides; 

12 (5) contacting said polysomes with an 

13 immobilized receptor under conditions suitable for specific 

14 binding and compatible with intact polysomes; 

25 (6) selecting polysomes which are bound to the 

16 immobilized receptor and removing unbound polysomes by washing 

17 with a suitable wash buffer and recovering polysomes bound to 

18 the immobilized receptor; 
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1 31. A method of claim 30, further comprising the 

2 step of purifying the polysomes by centrif ugation prior to 

3 contacting the polysomes with the receptor. 

1 3 2. A method of claim 30, wherein the in vitro 

2 transcription/translation system is an E. coli S30 system. 

1 33. A method of claim 30, comprising the further 

2 step of cloning the cDNAs into a bacteriophage display vector 

3 and performing affinity screening on a resultant library of 

4 bacteriophage particles displaying peptides encoded by the 

5 cDNAs . 

1 34. An improved method for identifying peptide 

2 sequences that bind to a predetermined receptor, comprising: 

3 (1) translating in vitro a population of mRNA 

4 molecules forming polysomes displaying nascent peptides; 

5 (2) centrifuging said polysomes and discarding 

6 the supernatant by high speed centrifugation pelleting the 

7 polysomes and discarding the supernatant, and recovering and 

8 resolubilizing the polysome pellet containing the polysomes; 

9 (3) contacting said polysomes with an 

10 immobilized receptor under conditions suitable for specific 

11 binding and compatible with intact polysomes; 

12 (4) selecting polysomes which are bound to the 

13 immobilized receptor and removing unbound polysomes by washing 

14 with a suitable wash buffer and recovering polysomes bound to 

15 the immobilized receptor; 

16 (5) dissociating the bound polysomes and making 

17 cDNAs from the mRNAs of the bound polysomes; and 

18 (6) sequencing the cDNAs. 

1 35. The method of claim 34, comprising the further 
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1 36. The method of claim 34, wherein the immobilized 

2 receptor is preblocked with a blocking agent and a non-ionic 

3 detergent is present in the binding and wash solutions . 

1 37. The method of claim 36, comprising the further 

2 step of cloning the cDNAs into pAFF6 and forming bacteriophage 

3 particles displaying the peptide sequences encoded by the 

4 cDNAs , affinity screening the bacteriophage particles, and 

5 sequencing polynucleotides isolated from bacteriophage 

6 particles bound to an immobilized receptor used for said 

7 affinity screening. 

1 3 8. A composition comprising a population of 

2 polysomes displaying nascent peptides, wherein each nascent 

3 peptide comprises a tether segment and a variable segment. 

1 3 9. A composition of claim 38, wherein the tether 

2 segment is a polypeptide segment which binds to RNA. 

1 4 0. A composition of claim 39, wherein the 

2 polypeptide segment binds to the Tar RNA sequence. 



3 41. A composition of claim 38, wherein the tether 

4 segment is biotinylated . 
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